Files
nvfp4-megamoe-kernel/dsv4
biondizzle ec8f292112 Fix: use self.mma_tiler_mnk (full K=64) for SMEM layout computation
SFA/SFB SMEM layouts need the full K dimension to compute the correct
number of K-tiles. self.mma_tiler has K=1 (placeholder for cute.slice_)
which gives 0 K-tiles and zero-dimension SMEM shapes.
2026-06-01 11:03:08 +00:00
..