This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
nvfp4-megamoe-kernel
Watch
1
Star
0
Fork
0
You've already forked nvfp4-megamoe-kernel
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
0ab5d8c317bc65aaa044401c8f430ffca5da38f1
nvfp4-megamoe-kernel
/
dsv4
History
biondizzle
0ab5d8c317
fix: disable broken CuTeDSL fused router — use BF16 linear + activation_topk (both are production paths)
2026-06-01 00:56:00 +00:00
..
cache
E1: Wire LayerCacheHandle gather methods + CUDA gather kernels
2026-05-30 21:09:21 +00:00
kernels
fix: disable broken CuTeDSL fused router — use BF16 linear + activation_topk (both are production paths)
2026-06-01 00:56:00 +00:00
layers
fix: transpose checkpoint weights before make_b_k_major in Nvfp4Linear/SharedExpert
2026-06-01 00:30:37 +00:00
loader
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00
model
Fix remaining mHC API references: layer_compare.py, layer.py comment
2026-05-31 18:38:34 +00:00
ops
fix: import SF_VEC_SIZE from quantize in gemm_runner (was NameError)
2026-06-01 00:04:48 +00:00
reference
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00
__init__.py
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00