vllm/vllm/entrypoints/serve at 5e1a373d2e62c04ba464c88303600839d6973365 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Sage 06e0bc21d2 [Frontend] Split OpenAIServingModels into OpenAIModelRegistry + OpenAIServingModels (#36536 )

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

2026-03-12 03:29:37 -07:00

..

Support clear mm and encoder cache (#33452 )

2026-01-31 15:22:25 +00:00

[openapi server] log exception in exception handler(2/N) (#36201 )

2026-03-10 20:16:30 -07:00

[Refactor] [10/N] to simplify the vLLM openai completion serving architecture (#32369 )

2026-01-15 07:41:34 +00:00

[Frontend] Add GPU-less render serving path (vllm launch render) (#36166 )

2026-03-08 16:35:09 +01:00

[Feat] allow inplace loading lora (#31326 )

2026-01-20 10:15:20 +08:00

[Cleanup] Refactor profiling env vars into a CLI config (#29912 )

2025-12-09 13:29:33 -05:00

[Frontend] Split OpenAIServingModels into OpenAIModelRegistry + OpenAIServingModels (#36536 )

2026-03-12 03:29:37 -07:00

[Feat][RL] Pause and Resume with keep requests for single engine (#32351 )

2026-02-07 00:08:58 +00:00

[Refactor] [4/N] Move VLLM_SERVER_DEV endpoints into the serve directory (#30749 )

2025-12-17 02:27:30 -08:00

[Core] Cleanup engine pause/sleep logic (#34528 )

2026-02-24 19:33:34 -08:00

feat: expose media_io_kwargs at runtime (#34778 )

2026-03-07 04:27:04 +00:00

__init__.py

[Frontend][CI] Consolidate instrumentator entrypoints (#34123 )

2026-02-10 07:30:19 +00:00