This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
DeepGEMM
Watch
1
Star
0
Fork
0
You've already forked DeepGEMM
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
b0094175a2fe8708a9a86575c2ed2846ccc05f20
DeepGEMM
/
csrc
History
biondizzle
b0094175a2
fix: restore elem_size declaration for TMA desc build
2026-05-12 17:40:25 +00:00
..
apis
fix: add kInt8 dtype support to TMA descriptor + change activation tensors to kInt8
2026-05-11 22:54:47 +00:00
indexing
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (
#304
)
2026-04-17 09:45:14 +08:00
jit
fix: gran_k=16 in transform_sf + sm_100a arch for NVFP4 mega_moe
2026-05-11 16:11:11 +00:00
jit_kernels
fix: restore elem_size declaration for TMA desc build
2026-05-12 17:40:25 +00:00
utils
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (
#304
)
2026-04-17 09:45:14 +08:00
python_api.cpp
feat: NVFP4 mega MoE kernel (scale_vec::4X, UE4M3 block scales)
2026-05-11 05:41:08 +00:00