Many thanks for the great work!
I was wondering regarding the Hadamard transform accuracy drop for nvfp4 with RTN in the paper, is it because of the block size of hadamard is too small (16 vs 32 in INT4 and MXFP4) such that the outlier smoothing characteristic of the HT has been weakened? Also, could randomness (i.e. RHT) help with the result?
Furthermore, would you consider decrease the Quantization block size/Hadamard transformaton size from 32 to 16 for INT4 as a additional option to verify it?
Kind regards