support flashinfer_fp4 moe for 5090 gpu (#26669)

Signed-off-by: XiaobingSuper <xiaobingzhangupc@gmail.com>
Signed-off-by: Michael Goin <mgoin64@gmail.com>
Co-authored-by: Michael Goin <mgoin64@gmail.com>
This commit is contained in:
XiaobingZhang
2025-10-16 03:06:47 +08:00
committed by GitHub
parent 1f491aa0c8
commit 0b99f5d302

View File

@@ -32,7 +32,7 @@ def is_flashinfer_fp4_cutlass_moe_available() -> bool:
envs.VLLM_USE_FLASHINFER_MOE_FP4
and has_flashinfer_cutlass_fused_moe()
and current_platform.is_cuda()
and current_platform.is_device_capability(100)
and current_platform.has_device_capability(100)
)