vllm/vllm/model_executor at fa6a6be51978bd4b49ba0da17039e60f96dc5b13 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Ye (Charlotte) Qi fa6a6be519 [Bugfix] Fix missing sequence_lengths in qwen3_omni_moe_thinker (#35741 )

Signed-off-by: Ye (Charlotte) Qi <yeq@meta.com>

2026-03-02 21:11:56 +00:00

..

[CPU][Feat] Enable KleidiAI INT8_W4A8 for all input dtypes (#34890 )

2026-02-26 05:00:10 +00:00

[Core] Move test utility to test file (#35672 )

2026-03-02 10:56:03 -05:00

add mixed precision support for modelopt (#35047 )

2026-02-26 21:56:24 +00:00

[Bugfix] Fix missing sequence_lengths in qwen3_omni_moe_thinker (#35741 )

2026-03-02 21:11:56 +00:00

[offloader] v2: Hide weight onloading latency via prefetching (#29941 )

2026-02-25 17:20:59 -08:00

[Platform] Add current_platform.num_compute_units interface (#35042 )

2026-02-24 22:22:49 -08:00

__init__.py

[Platform] Deprecate seed_everything (#31659 )

2026-01-04 18:34:04 -08:00

custom_op.py

[Model Bash][DSR1] Add selective dynamic shape marking for CustomOp (#34900 )

2026-02-21 19:28:01 -05:00

parameter.py

[QeRL] Layerwise Reloading (#32133 )

2026-01-30 08:50:05 -07:00

utils.py

[BugFix] Fix EPLB fail for MoeFP4 model with Marlin backend (#33262 )

2026-01-29 16:52:11 +08:00