This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
DeepGEMM
Watch
1
Star
0
Fork
0
You've already forked DeepGEMM
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
4442c06ba85c027bb86eca283ce7362f922a3d9f
DeepGEMM
/
csrc
History
biondizzle
2c09545faa
diag: force block_m=128 to test UMMA_N=192 validity for mxf4nvf4
2026-05-12 19:37:11 +00:00
..
apis
fix: add kInt8 dtype support to TMA descriptor + change activation tensors to kInt8
2026-05-11 22:54:47 +00:00
indexing
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (
#304
)
2026-04-17 09:45:14 +08:00
jit
fix: gran_k=16 in transform_sf + sm_100a arch for NVFP4 mega_moe
2026-05-11 16:11:11 +00:00
jit_kernels
diag: force block_m=128 to test UMMA_N=192 validity for mxf4nvf4
2026-05-12 19:37:11 +00:00
utils
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (
#304
)
2026-04-17 09:45:14 +08:00
python_api.cpp
feat: NVFP4 mega MoE kernel (scale_vec::4X, UE4M3 block scales)
2026-05-11 05:41:08 +00:00