-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about Dual-Stream Information Exchange in MP-SENet #9
Comments
Thank you for your interest in our previous works. In fact, MP-SENet does not employ a dual-stream structure; |
Thank you for your response. |
Oh, and one more question! Would your AP-BWE model work well for a parallel decoder-only speech enhancement task, similar to PHASEN? |
Sorry for the delayed reply, I believe the AP-BWE framework can handle the SE task by just modifying the magnitude stream to a masking or mapping-based architecture. But I think it won't have such strong SE capabilities as the AP-BWE doesn't employ the Transformers to capture long-term dependencies, which is important for handling the time-variant noise in noisy signals. |
Thank you for the detailed and helpful response! I just have one last question. As far as I understand, you used a ConvNeXt block as the backbone, and converted the original 2D model into a 1D version. |
Hello,
I’m really impressed by your approach to directly estimating phase, and I truly appreciate the great work you’ve been consistently publishing.
I wanted to ask about your previous paper, MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra. In that work, was there a specific reason why you didn’t adopt PHASEN’s dual-stream information exchange mechanism?
Looking forward to your thoughts!
Best regards,
The text was updated successfully, but these errors were encountered: