Files
DeepGEMM/deep_gemm/mega
biondizzle e498a2c729 fix: single transpose back to MN-major, don't double-transpose
The .contiguous().transpose() dance was swapping dims back.
A single transpose from (g,k,mn) gives (g,mn,k) with stride(-2)=1,
which is exactly the MN-major layout TMA expects.
2026-05-12 14:23:02 +00:00
..