This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
nvfp4-megamoe-kernel
Watch
1
Star
0
Fork
0
You've already forked nvfp4-megamoe-kernel
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
42c5793add89c837d76782b4eca3f64d14ebd3ea
nvfp4-megamoe-kernel
/
dsv4
History
biondizzle
42c5793add
D1.5: Add isolated round-trip test comparing s_k=128 vs s_k=256 with NOOP rescale
2026-05-26 20:45:58 +00:00
..
cache
Flush compressor: schema fix, prepare_forward, flush_write kernels, state rotation
2026-05-22 00:25:47 +00:00
kernels
D1.5: Add isolated round-trip test comparing s_k=128 vs s_k=256 with NOOP rescale
2026-05-26 20:45:58 +00:00
layers
NVFP4-1.1 integration: GPU-only quantize kernel + MoE pipeline wiring
2026-05-25 16:19:07 +00:00
loader
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00
model
Fix layer construction: match existing API signatures, add RMSNorm impl
2026-05-21 23:31:58 +00:00
ops
NVFP4-3: add use_2cta_instrs conditional to gemm_runner
2026-05-25 16:42:02 +00:00
reference
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00
__init__.py
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00