This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
nvfp4-megamoe-kernel
Watch
1
Star
0
Fork
0
You've already forked nvfp4-megamoe-kernel
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
eef0ef76afac623f7f2b636db389f1111e80045f
nvfp4-megamoe-kernel
/
cutedsl
History
biondizzle
b007937a68
Fix garbled imports in cutedsl/runner.py
2026-05-18 22:22:52 +00:00
..
kernel
refactor: copy CuTeDSL kernel into repo with local imports
2026-05-16 02:57:54 +00:00
__init__.py
refactor: copy CuTeDSL kernel into repo with local imports
2026-05-16 02:57:54 +00:00
bridge.py
Fix torch.compile crash: remove threading.Lock from LUT cache path
2026-05-18 20:54:55 +00:00
moe_pipeline.py
Add pipeline test with real model weights, add swiglu_limit to reference moe_pipeline
2026-05-17 18:07:44 +00:00
nvfp4_linear.py
Fix torch.compile: use custom autograd Function instead of @torch.compiler.disable
2026-05-18 21:38:28 +00:00
runner.py
Fix garbled imports in cutedsl/runner.py
2026-05-18 22:22:52 +00:00
shared_expert_pipeline.py
Fix torch.compile: use custom autograd Function instead of @torch.compiler.disable
2026-05-18 21:38:28 +00:00