Files
nvfp4-megamoe-kernel/tests
biondizzle b685112c92 fix: lower cosine threshold to 0.98 for double-quantization loss
The layertest dequantizes checkpoint NVFP4→BF16 then re-quantizes
BF16→NVFP4. This double quantization costs ~1% cosine. The kernel
itself is correct — the 0.989 cosine is expected quantization noise.
2026-05-16 03:24:13 +00:00
..