Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
c46b0cd0af778e5678c52347c5b4957c1e8a0715
vllm/vllm/entrypoints/serve
History
Cyrus Leung d117a4d1a9 [Frontend] Introduce Renderer for processing chat messages (using ModelConfig) (#30200)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
2026-01-22 12:44:22 +00:00
..
cache
[Refactor] [4/N] Move VLLM_SERVER_DEV endpoints into the serve directory (#30749)
2025-12-17 02:27:30 -08:00
disagg
[Refactor] [10/N] to simplify the vLLM openai completion serving architecture (#32369)
2026-01-15 07:41:34 +00:00
elastic_ep
[Refactor] [10/N] to simplify the vLLM openai completion serving architecture (#32369)
2026-01-15 07:41:34 +00:00
instrumentator
UX: add vLLM env info in '/server_info' (#31899)
2026-01-07 17:13:02 +00:00
lora
[Feat] allow inplace loading lora (#31326)
2026-01-20 10:15:20 +08:00
profile
[Cleanup] Refactor profiling env vars into a CLI config (#29912)
2025-12-09 13:29:33 -05:00
rlhf
[Refactor] [1/N] to simplify the vLLM serving architecture (#28040)
2025-12-03 01:26:39 -08:00
rpc
[Refactor] [4/N] Move VLLM_SERVER_DEV endpoints into the serve directory (#30749)
2025-12-17 02:27:30 -08:00
sleep
[Refactor] [4/N] Move VLLM_SERVER_DEV endpoints into the serve directory (#30749)
2025-12-17 02:27:30 -08:00
tokenize
[Frontend] Introduce Renderer for processing chat messages (using ModelConfig) (#30200)
2026-01-22 12:44:22 +00:00
__init__.py
[Feature] Add offline FastAPI documentation support for air-gapped environments (#30184)
2025-12-29 16:22:39 +00:00
Powered by Gitea Version: 1.25.2 Page: 20ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API