This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
nvfp4-megamoe-kernel
Watch
1
Star
0
Fork
0
You've already forked nvfp4-megamoe-kernel
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
4300775bfec557d514e52ed00459bef6fdff161a
nvfp4-megamoe-kernel
/
cutedsl
History
biondizzle
5a79065b2b
fix: GEMM output should be 2x packed N (float4_e2m1fn_x2 packs 2 per element)
2026-05-16 18:27:44 +00:00
..
kernel
refactor: copy CuTeDSL kernel into repo with local imports
2026-05-16 02:57:54 +00:00
__init__.py
refactor: copy CuTeDSL kernel into repo with local imports
2026-05-16 02:57:54 +00:00
bridge.py
fix: GEMM output should be 2x packed N (float4_e2m1fn_x2 packs 2 per element)
2026-05-16 18:27:44 +00:00
moe_pipeline.py
fix: same gate/up split fix in moe_pipeline.py
2026-05-16 04:04:53 +00:00