Logo
Explore Help
Register Sign In
biondizzle/nvfp4-megamoe-kernel
1
0
Fork 0
You've already forked nvfp4-megamoe-kernel
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
fece06f746ce2198fc4f3fcdaa550ed196a052bb
nvfp4-megamoe-kernel/vllm
History
biondizzle b0b5113467 Fix weight mapper: compressor → attn.compressor (not mla_attn), quant weights_proj
- The compressor is on attn.compressor (not attn.mla_attn.compressor)
- weights_proj in indexer is NVFP4-quantized in our checkpoint
2026-05-19 03:20:41 +00:00
..
kernels/linear/nvfp4
Replace autograd.Function with torch.library.custom_op for Dynamo compat
2026-05-19 01:54:48 +00:00
patches
Fix weight mapper: compressor → attn.compressor (not mla_attn), quant weights_proj
2026-05-19 03:20:41 +00:00
cutedsl_quant_method.py
Replace autograd.Function with torch.library.custom_op for Dynamo compat
2026-05-19 01:54:48 +00:00
nvfp4_cutedsl.py
Replace autograd.Function with torch.library.custom_op for Dynamo compat
2026-05-19 01:54:48 +00:00
Powered by Gitea Version: 1.25.2 Page: 27ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API