This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
nvfp4-megamoe-kernel
Watch
1
Star
0
Fork
0
You've already forked nvfp4-megamoe-kernel
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
0ab5d8c317bc65aaa044401c8f430ffca5da38f1
nvfp4-megamoe-kernel
/
dsv4
/
kernels
History
biondizzle
0ab5d8c317
fix: disable broken CuTeDSL fused router — use BF16 linear + activation_topk (both are production paths)
2026-06-01 00:56:00 +00:00
..
attention
FMHA sink: don't double-scale sink bias
2026-05-31 23:12:20 +00:00
cache
fix: correct gather.py kernel_dir path
2026-05-30 21:12:09 +00:00
compressor
Wire indexer compute_index_scores_topk + fix compressor imports
2026-05-30 21:19:06 +00:00
cuda
fix: extern declarations for gather_swa functions in gather_kv.cu
2026-05-30 21:14:15 +00:00
gemm
NVFP4-1.1: Mark fp4_quant.py as toolchain-blocked, clean up test files
2026-05-28 04:59:01 +00:00
indexer
Wire indexer compute_index_scores_topk + fix compressor imports
2026-05-30 21:19:06 +00:00
router
fix: disable broken CuTeDSL fused router — use BF16 linear + activation_topk (both are production paths)
2026-06-01 00:56:00 +00:00
__init__.py
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00