Files
DeepGEMM/deep_gemm
biondizzle bfe612969b fix: preserve MN-major layout when interleaving L1 SF tensors
_interleave_l1_weights used empty_like+copy_ which destroyed the
MN-major stride layout required by TMA. Added interleave_sf_mn_major
that works in K-major, interleaves, then transposes back to MN-major.
2026-05-12 14:01:58 +00:00
..