Files
nvfp4-megamoe-kernel/src/nvfp4_megamoe_kernel
biondizzle 3ba41b9322 fix: use slot_token identity check instead of shape heuristic for gather
Shape-based check (x_fp4.shape[0] != num_slots) silently fails when
num_tokens == num_slots in L1 (topk=1). Now checks if slot_token is
the identity mapping — only gathers when slot ordering differs from
token ordering.
2026-05-15 10:00:41 +00:00
..
2026-05-15 08:28:40 +00:00