This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
nvfp4-megamoe-kernel
Watch
1
Star
0
Fork
0
You've already forked nvfp4-megamoe-kernel
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
da6fa2f1d6312c111fcfe7684e286d9de915f9c0
nvfp4-megamoe-kernel
/
vllm
History
biondizzle
da6fa2f1d6
Fix UnboundLocalError: move num_decode_tokens before debug print
2026-05-19 16:43:28 +00:00
..
kernels/linear
/nvfp4
Fix OOM: add --max-model-len=876544 + revert CPU dummy weight
2026-05-19 07:35:43 +00:00
patches
Fix UnboundLocalError: move num_decode_tokens before debug print
2026-05-19 16:43:28 +00:00
cutedsl_quant_method.py
Fix OOM: add --max-model-len=876544 + revert CPU dummy weight
2026-05-19 07:35:43 +00:00
nvfp4_cutedsl.py
Replace autograd.Function with torch.library.custom_op for Dynamo compat
2026-05-19 01:54:48 +00:00