vllm/vllm/model_executor at 0b53bec60b9b41c9e16cefa2db367afb6a60628d - vllm

Files

Robert Shaw 5a93b9162b [MoE Refactor] Integrate Naive Prepare Finalize into MK (#32567 )

Signed-off-by: Robert Shaw <robshaw@redhat.com>
Signed-off-by: Amir Klein <203507526+amirkl94@users.noreply.github.com>
Co-authored-by: Robert Shaw <robshaw@redhat.com>
Co-authored-by: amirkl94 <203507526+amirkl94@users.noreply.github.com>

2026-01-27 01:28:02 +00:00

layers

[MoE Refactor] Integrate Naive Prepare Finalize into MK (#32567 )

2026-01-27 01:28:02 +00:00

model_loader

Add llmcompressor fp8 kv-cache quant (per-tensor and per-attn_head) (#30141 )

2026-01-22 13:29:57 -07:00

models

[Model] Bump transformers version for test registry (#33100 )

2026-01-26 18:53:22 +00:00

warmup

[MoE Refactor] Integrate Naive Prepare Finalize into MK (#32567 )

2026-01-27 01:28:02 +00:00

__init__.py

[Platform] Deprecate seed_everything (#31659 )

2026-01-04 18:34:04 -08:00

custom_op.py

[torch.compile] Compile CustomOp.forward_native for SiluAndMul and QuantFP8 to avoid raw torch ops inside opaque custom ops (#32806 )

2026-01-22 19:52:26 -08:00

parameter.py

[Docs] Replace rst style double-backtick with md single-backtick (#27091 )

2025-10-17 02:47:34 -07:00

utils.py

[Platform] Deprecate seed_everything (#31659 )

2026-01-04 18:34:04 -08:00