This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
nvfp4-megamoe-kernel
Watch
1
Star
0
Fork
0
You've already forked nvfp4-megamoe-kernel
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
efe63caea97366fc6f6ffb184e64ada38eef8af7
nvfp4-megamoe-kernel
/
dsv4
/
kernels
/
router
History
biondizzle
62efde5c9f
fix: router — use cuBLAS BF16 GEMM + activation_topk CUDA kernel (production path, not CuTeDSL fused)
2026-06-01 01:01:15 +00:00
..
__init__.py
Router: clean up dense_router_decode.py — realistic architecture, no fake code
2026-05-21 21:58:31 +00:00
_activation_topk.py
Router: full kernel stack — hash, topk, activation+topk, dense decode/prefill
2026-05-21 21:54:05 +00:00
dense_router_decode_kernel.py
fix: router kernel — infer OperandMajorMode from tensor layout (same pattern as MoE GEMM)
2026-06-01 00:59:18 +00:00
dense_router_decode.py
fix: router — use cuBLAS BF16 GEMM + activation_topk CUDA kernel (production path, not CuTeDSL fused)
2026-06-01 01:01:15 +00:00
dense_router_prefill.py
Router: clean up dense_router_decode.py — realistic architecture, no fake code
2026-05-21 21:58:31 +00:00