This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
nvfp4-megamoe-kernel
Watch
1
Star
0
Fork
0
You've already forked nvfp4-megamoe-kernel
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
fae61d3ef772aa14b41f425cf3851c7bfa7fdaf6
nvfp4-megamoe-kernel
/
dsv4
/
kernels
History
biondizzle
fae61d3ef7
Add c10/cuda/CUDAStream.h include for getCurrentCUDAStream
2026-06-04 04:13:40 +00:00
..
attention
Wire prefill FMHA into production.py and single_shot
2026-06-03 03:49:57 +00:00
cache
Cleanup Step 2: Archive Lineage P code, fix broken imports
2026-06-02 19:27:07 +00:00
compressor
Cleanup Step 2: Archive Lineage P code, fix broken imports
2026-06-02 19:27:07 +00:00
cuda
Add c10/cuda/CUDAStream.h include for getCurrentCUDAStream
2026-06-04 04:13:40 +00:00
gemm
Blackwell swizzle CUDA kernel for CUDA graph capture
2026-06-04 03:03:02 +00:00
indexer
Cleanup Step 2: Archive Lineage P code, fix broken imports
2026-06-02 19:27:07 +00:00
router
Cleanup Step 2: Archive Lineage P code, fix broken imports
2026-06-02 19:27:07 +00:00
__init__.py
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00