This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
nvfp4-megamoe-kernel
Watch
1
Star
0
Fork
0
You've already forked nvfp4-megamoe-kernel
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
bcfbd1e25b5fa7943014c4c8aab5820b28fae782
nvfp4-megamoe-kernel
/
vllm
History
biondizzle
e91421f06e
Fix KV cache page size patch: separate groups for large SWA pages
2026-05-19 09:05:14 +00:00
..
kernels/linear
/nvfp4
Fix OOM: add --max-model-len=876544 + revert CPU dummy weight
2026-05-19 07:35:43 +00:00
patches
Fix KV cache page size patch: separate groups for large SWA pages
2026-05-19 09:05:14 +00:00
cutedsl_quant_method.py
Fix OOM: add --max-model-len=876544 + revert CPU dummy weight
2026-05-19 07:35:43 +00:00
nvfp4_cutedsl.py
Replace autograd.Function with torch.library.custom_op for Dynamo compat
2026-05-19 01:54:48 +00:00