This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
nvfp4-megamoe-kernel
Watch
1
Star
0
Fork
0
You've already forked nvfp4-megamoe-kernel
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
cea453cbabe88fd141fee13a12342ce2d0fa818e
nvfp4-megamoe-kernel
/
vllm
History
biondizzle
255913fba4
Vectorize paged KV cache read/write, kill container
2026-05-19 15:48:16 +00:00
..
kernels/linear
/nvfp4
Fix OOM: add --max-model-len=876544 + revert CPU dummy weight
2026-05-19 07:35:43 +00:00
patches
Vectorize paged KV cache read/write, kill container
2026-05-19 15:48:16 +00:00
cutedsl_quant_method.py
Fix OOM: add --max-model-len=876544 + revert CPU dummy weight
2026-05-19 07:35:43 +00:00
nvfp4_cutedsl.py
Replace autograd.Function with torch.library.custom_op for Dynamo compat
2026-05-19 01:54:48 +00:00