This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
nvfp4-megamoe-kernel
Watch
1
Star
0
Fork
0
You've already forked nvfp4-megamoe-kernel
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
a5fabbdf66f55efd61bbae135925f1dcd0698017
nvfp4-megamoe-kernel
/
vllm
History
biondizzle
a5fabbdf66
Apply RoPE to KV in Blackwell attention path - fix NaN output
2026-05-19 10:27:15 +00:00
..
kernels/linear
/nvfp4
Fix OOM: add --max-model-len=876544 + revert CPU dummy weight
2026-05-19 07:35:43 +00:00
patches
Apply RoPE to KV in Blackwell attention path - fix NaN output
2026-05-19 10:27:15 +00:00
cutedsl_quant_method.py
Fix OOM: add --max-model-len=876544 + revert CPU dummy weight
2026-05-19 07:35:43 +00:00
nvfp4_cutedsl.py
Replace autograd.Function with torch.library.custom_op for Dynamo compat
2026-05-19 01:54:48 +00:00