Files
nvfp4-megamoe-kernel/tests/unit
biondizzle fae9f6fbb5 Reset to working_softmax_maybe.py + TMA fix only
Previous O rescale attempt broke n=128 (0.464773).
Revert to known-good softmax code, only apply TMA fix:
tBgK[(None,None,0,0)] → tBgK[(None,0,None,0)]

Expected: n=128 cos 0.999998 (same as working), n=256 cos 0.71 (TMA fix loads 2 tiles but no O rescale)
2026-05-23 00:27:41 +00:00
..