vllm/vllm/model_executor at a8c6ee9b787d273916206a29b77feebadb80c368 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Richard Zou 6c749399b7 [BugFix] fix tests/kernels/moe/test_moe_layer.py (#39404 )

Signed-off-by: Richard Zou <zou3519@gmail.com>

2026-04-09 08:48:59 -04:00

..

[W8A8 Block Linear Refactor][2/N] Remove W8A8Fp8BlockLinearOp and adopt Fp8 block linear kernel selections. (#33892 )

2026-04-09 08:50:39 +08:00

[BugFix] fix tests/kernels/moe/test_moe_layer.py (#39404 )

2026-04-09 08:48:59 -04:00

[Frontend] new online quantization frontend (#38138 )

2026-04-03 11:58:39 -04:00

nemotron-nano-vl: Allow use_audio_in_video to be passed at vllm serve time (#38538 )

2026-04-09 11:44:39 +00:00

Bugfix for offloading+prefetch for GLM-4.7-FP8 (#37178 )

2026-03-17 21:22:09 +08:00

[MoE] Move DEEP_GEMM into experts/ subdirectory (#39005 )

2026-04-08 19:23:08 +00:00

__init__.py

[Platform] Deprecate seed_everything (#31659 )

2026-01-04 18:34:04 -08:00

custom_op.py

Add ability to replace oot ops when using lora (#37181 )

2026-03-16 18:04:15 -07:00

parameter.py

[Mypy] Fix mypy for vllm/model_executor (except vllm/model_executor/layers) (#37904 )

2026-03-24 17:14:01 +00:00

utils.py

[BugFix] Fix EPLB fail for MoeFP4 model with Marlin backend (#33262 )

2026-01-29 16:52:11 +08:00