Question about Dual-Stream Information Exchange in MP-SENet #9

EuiYeonKim · 2025-02-14T04:10:15Z

Hello,

I’m really impressed by your approach to directly estimating phase, and I truly appreciate the great work you’ve been consistently publishing.

I wanted to ask about your previous paper, MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra. In that work, was there a specific reason why you didn’t adopt PHASEN’s dual-stream information exchange mechanism?

Looking forward to your thoughts!

Best regards,

yxlu-0102 · 2025-02-15T12:18:00Z

Thank you for your interest in our previous works.

In fact, MP-SENet does not employ a dual-stream structure;
the magnitude and phase parts share the encoder and Transformer blocks, and only diverge in the decoders.
We believe that the magnitude and phase information is already integrated in the previous blocks, so there is no need for an additional information interaction mechanism.

EuiYeonKim · 2025-02-18T05:52:33Z

Thank you for your response.
I understand that your point is that since there was already sufficient information exchange between amplitude and phase earlier, the parallel decoder does not need to exchange information later.
Your answer was very helpful. I will now close this issue.

EuiYeonKim · 2025-02-20T05:49:09Z

Oh, and one more question! Would your AP-BWE model work well for a parallel decoder-only speech enhancement task, similar to PHASEN?

yxlu-0102 · 2025-03-13T14:09:54Z

Sorry for the delayed reply, I believe the AP-BWE framework can handle the SE task by just modifying the magnitude stream to a masking or mapping-based architecture.

But I think it won't have such strong SE capabilities as the AP-BWE doesn't employ the Transformers to capture long-term dependencies, which is important for handling the time-variant noise in noisy signals.

EuiYeonKim · 2025-03-24T06:25:50Z

Thank you for the detailed and helpful response!

I just have one last question. As far as I understand, you used a ConvNeXt block as the backbone, and converted the original 2D model into a 1D version.
Could you please explain the reasoning behind this design choice?

EuiYeonKim closed this as completed Feb 18, 2025

EuiYeonKim reopened this Feb 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about Dual-Stream Information Exchange in MP-SENet #9

Question about Dual-Stream Information Exchange in MP-SENet #9

EuiYeonKim commented Feb 14, 2025

yxlu-0102 commented Feb 15, 2025

EuiYeonKim commented Feb 18, 2025

EuiYeonKim commented Feb 20, 2025

yxlu-0102 commented Mar 13, 2025

EuiYeonKim commented Mar 24, 2025

Question about Dual-Stream Information Exchange in MP-SENet #9

Question about Dual-Stream Information Exchange in MP-SENet #9

Comments

EuiYeonKim commented Feb 14, 2025

yxlu-0102 commented Feb 15, 2025

EuiYeonKim commented Feb 18, 2025

EuiYeonKim commented Feb 20, 2025

yxlu-0102 commented Mar 13, 2025

EuiYeonKim commented Mar 24, 2025