fix reduction mistake in SpectralConvergenceLoss #75

renared · 2024-05-22T17:29:41Z

I noticed that when evaluating the STFT loss over my validation dataset, I obtained different results in function of the batch size. I could isolate the cause to be the spectral convergence term, then came across the comment by @egaznep in issue #69. It does not make sense to average the denominator over all dimensions including the batch dimension, so I believe their suggestion should be used instead.

This snippet shows the difference:

import torch
from auraloss.freq import STFTLoss

batches = [(torch.randn(4, 1, 16384), torch.randn(4, 1, 16384)) for i in range(1024)]
batchall = tuple(torch.concat(u, dim=0) for u in zip(*batches))

print("with spectral convergence enabled")
loss = STFTLoss()
print("mean of losses:", torch.mean(torch.tensor(tuple(loss(*batch) for batch in batches))))
print("over full dataset:", loss(*batchall))

print("with spectral convergence disabled")
loss = STFTLoss(w_sc=0)
print("mean of losses:", torch.mean(torch.tensor(tuple(loss(*batch) for batch in batches))))
print("over full dataset:", loss(*batchall))

Before:

with spectral convergence enabled
mean of losses: tensor(1.3511)
over full dataset: tensor(1.3493)
with spectral convergence disabled
mean of losses: tensor(0.6950)
over full dataset: tensor(0.6950)

After:

with spectral convergence enabled
mean of losses: tensor(1.3726)
over full dataset: tensor(1.3726)
with spectral convergence disabled
mean of losses: tensor(0.7095)
over full dataset: tensor(0.7095)

@egaznep

the denominator was averaged over all dimensions including the batch dimension, see comment by @egaznep in csteinmetz1#69

cpvlordelo · 2025-02-24T10:39:40Z

I just stumbled on the exact same problem. Is there any plans on merging this fix? Ping @csteinmetz1?

cpvlordelo · 2025-02-24T17:09:34Z

auraloss/freq.py

@@ -16,7 +15,7 @@ def __init__(self):
        super(SpectralConvergenceLoss, self).__init__()

    def forward(self, x_mag, y_mag):
-        return torch.norm(y_mag - x_mag, p="fro") / torch.norm(y_mag, p="fro")
+        return torch.norm(y_mag - x_mag, p="fro", dim=(-1, -2), keepdim=True) / torch.norm(y_mag, p="fro", dim=(-1, -2), keepdim=True)


Suggested change

return torch.norm(y_mag - x_mag, p="fro", dim=(-1, -2), keepdim=True) / torch.norm(y_mag, p="fro", dim=(-1, -2), keepdim=True)

return (torch.norm(y_mag - x_mag, p="fro", dim=(-1, -2)) / torch.norm(y_mag, p="fro", dim=(-1, -2))).mean()

Since you removed the reduction, this is now returning a multi-dimensional tensor. It does work with STFTLoss because the reduction is done inside of it as you can see here, but if you instantiate SpectralConvergenceLoss, on the other hand, then your example code there will crash.

import torch from auraloss.freq import SpectralConvergenceLoss batches = [(torch.randn(4, 1, 16384), torch.randn(4, 1, 16384)) for i in range(1024)] batchall = tuple(torch.concat(u, dim=0) for u in zip(*batches)) loss = SpectralConvergenceLoss() print("Shape of Spectral Convergence Loss over full dataset:", loss(*batchall).shape) print("mean of losses:", torch.mean(torch.tensor(tuple(loss(*batch) for batch in batches))))

Before:

Shape of Spectral Convergence Loss over full dataset: torch.Size([]) mean of losses: tensor(1.4144)

After:

Shape of Spectral Convergence Loss full dataset: torch.Size([4096, 1, 1]) --------------------------------------------------------------------------- ValueError Traceback (most recent call last) [<ipython-input-45-952d11dbfe6a>](https://localhost:8080/#) in <cell line: 0>() 23 print("Shape of Spectral Convergence Loss over full dataset:", loss(*batchall).shape) ---> 24 print("mean of losses:", torch.mean(torch.tensor(tuple(loss(*batch) for batch in batches)))) ValueError: only one element tensors can be converted to Python scalars

This is just a suggestion that will always perform the reduction as mean.

But an even better option, in my opinion, would be to add a new string argument reduction as part of init and call apply_reduction inside this forward method in a similar way done in STFTLoss code.

fix reduction mistake in SpectralConvergenceLoss

d6f861f

the denominator was averaged over all dimensions including the batch dimension, see comment by @egaznep in csteinmetz1#69

cpvlordelo reviewed Feb 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix reduction mistake in SpectralConvergenceLoss #75

fix reduction mistake in SpectralConvergenceLoss #75

renared commented May 22, 2024

cpvlordelo commented Feb 24, 2025

cpvlordelo Feb 24, 2025 •

edited

Loading

	return torch.norm(y_mag - x_mag, p="fro", dim=(-1, -2), keepdim=True) / torch.norm(y_mag, p="fro", dim=(-1, -2), keepdim=True)
	return (torch.norm(y_mag - x_mag, p="fro", dim=(-1, -2)) / torch.norm(y_mag, p="fro", dim=(-1, -2))).mean()

fix reduction mistake in SpectralConvergenceLoss #75

Are you sure you want to change the base?

fix reduction mistake in SpectralConvergenceLoss #75

Conversation

renared commented May 22, 2024

cpvlordelo commented Feb 24, 2025

cpvlordelo Feb 24, 2025 • edited Loading

Choose a reason for hiding this comment

cpvlordelo Feb 24, 2025 •

edited

Loading