You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
During quantization (using pytorch_quantization), the qparams (scale and zero_point) of old Conv is computed using Calibrator. However, when the Conv and Batch Normalization (BN) layers are fused, the weights and biases of the fused Conv change. In this case, the original qparams may not be applicable anymore. Could you please explain how to correctly determine the new qparams (scale and zero_point) after this fusion?
The text was updated successfully, but these errors were encountered:
So the weight range after fusion changes, trt compiler will recalculate the dynamic range of the fused weights 𝑊' values (such as max/histogram calibrator method) through statistical analysis., that is w_Q, w_DQ will change.
During quantization (using pytorch_quantization), the qparams (scale and zero_point) of old Conv is computed using Calibrator. However, when the Conv and Batch Normalization (BN) layers are fused, the weights and biases of the fused Conv change. In this case, the original qparams may not be applicable anymore. Could you please explain how to correctly determine the new qparams (scale and zero_point) after this fusion?
The text was updated successfully, but these errors were encountered: