This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
DeepGEMM
Watch
1
Star
0
Fork
0
You've already forked DeepGEMM
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
c1cbe488f3cd91ce984a5328eb04afa122a7763c
DeepGEMM
/
deep_gemm
History
biondizzle
c1cbe488f3
diag: force a_format/b_format=5 (MXF8F6F4Format::E2M1), re-enable MMA, dump k=0+k=1
2026-05-12 19:06:28 +00:00
..
include
/deep_gemm
diag: force a_format/b_format=5 (MXF8F6F4Format::E2M1), re-enable MMA, dump k=0+k=1
2026-05-12 19:06:28 +00:00
legacy
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (
#304
)
2026-04-17 09:45:14 +08:00
mega
fix: no GPU tensor ops in crash handler (CUDA is broken after 715)
2026-05-12 16:20:11 +00:00
testing
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (
#304
)
2026-04-17 09:45:14 +08:00
utils
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (
#304
)
2026-04-17 09:45:14 +08:00
__init__.py
Add various optimizations and Mega MoE benchmarks (
#316
)
2026-04-24 18:41:37 +08:00