Bump Flashinfer Version and Re-enable DeepSeek NVFP4 AR+Norm Fusion (#34899)

Signed-off-by: wzhao18 <wzhao18.sz@gmail.com>
Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>
This commit is contained in:
Wei Zhao
2026-02-20 16:37:31 -05:00
committed by GitHub
parent 0632ed8778
commit ea5f903f80
5 changed files with 6 additions and 29 deletions

View File

@@ -68,7 +68,7 @@
"default": "true"
},
"FLASHINFER_VERSION": {
"default": "0.6.3"
"default": "0.6.4"
},
"GDRCOPY_CUDA_VERSION": {
"default": "12.8"