nvfp4-megamoe-kernel/vllm at 7e97551fd3dd9fcc524e708e347a7defdce7004d - nvfp4-megamoe-kernel - Gitea: Git with a cup of tea

biondizzle/nvfp4-megamoe-kernel

Files

History

biondizzle 7e97551fd3 Fix: use self.scale instead of self.softmax_scale in Blackwell attention path

2026-05-19 10:04:46 +00:00

..

kernels/linear/nvfp4

Fix OOM: add --max-model-len=876544 + revert CPU dummy weight

2026-05-19 07:35:43 +00:00

Fix: use self.scale instead of self.softmax_scale in Blackwell attention path

2026-05-19 10:04:46 +00:00

cutedsl_quant_method.py

Fix OOM: add --max-model-len=876544 + revert CPU dummy weight

2026-05-19 07:35:43 +00:00

nvfp4_cutedsl.py

Replace autograd.Function with torch.library.custom_op for Dynamo compat

2026-05-19 01:54:48 +00:00