biondizzle/vllm - vllm - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Yongye Zhu	e8ebbdde83	[Quantization] Add FlashInfer CuteDSL batched experts backend for NVFP4 MoE (#38251 ) Signed-off-by: Yongye Zhu <zyy1102000@gmail.com> Co-authored-by: Michael Goin <mgoin64@gmail.com> Co-authored-by: Roger Wang <hey@rogerw.io>	2026-04-06 11:57:53 -07:00
Zhewen Li	be1a85b7a2	Revert "[MoE Kernel] Flashinfer nvfp4 cutedsl moe kernel integration" (#38050 ) (#38169 ) Co-authored-by: Zhewen Li <zhewenli@inferact.ai>	2026-03-26 07:59:09 -07:00
Yongye Zhu	678b3c99e8	[MoE Kernel] Flashinfer nvfp4 cutedsl moe kernel integration (#38050 )	2026-03-25 10:16:40 -07:00
Robert Shaw	6b2fa3a762	[MoE] Move FlashInfer CuteDSL experts into fused_moe/experts/ (#37759 ) Signed-off-by: Robert Shaw <robertgshaw2@gmail.com>	2026-03-21 19:15:16 -04:00
lukec	15a0b9e570	Fix spelling errors (#33978 )	2026-02-06 23:58:50 -08:00
Shu Wang	613abb50d5	[MoE] Nvfp4 Masked Gemm: Add flashinfer grouped_gemm_nt_masked (#25990 ) Signed-off-by: Shu Wang. <shuw@nvidia.com> Signed-off-by: mgoin <mgoin64@gmail.com> Co-authored-by: Michael Goin <mgoin64@gmail.com>	2025-11-19 13:29:06 -08:00