vllm/vllm/model_executor at 56e96b37e4951946c06379b4891d8170e743dcc2 - vllm

Files

Cyrus Leung 0e741c12e3 [Bugfix] Fix Plamo3 rope handling (#29092 )

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2025-11-21 11:38:35 +08:00

layers

[ROCm] Fix for import when building with upstream triton for gfx1100 for gpt-oss serving (#29127 )

2025-11-21 03:30:07 +00:00

model_loader

[torchao] fix safetensors for sharding (#28169 )

2025-11-19 16:39:45 -08:00

models

[Bugfix] Fix Plamo3 rope handling (#29092 )

2025-11-21 11:38:35 +08:00

warmup

[Core] Encoder separation for Encode-Prefill-Decode Disaggregation (#25233 )

2025-11-11 18:58:33 -08:00

__init__.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

custom_op.py

[FrontEnd] UNREVERT CompilationConfig overhaul (#20283 ): deprecate use_inductor in favor of backend, simplify custom_ops (#26502 )

2025-10-13 22:47:16 +00:00

parameter.py

[Docs] Replace rst style double-backtick with md single-backtick (#27091 )

2025-10-17 02:47:34 -07:00

utils.py

[CI] Fix mypy for vllm/v1/worker (#29037 )

2025-11-21 11:36:07 +08:00