biondizzle/vllm - vllm - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Micah Williamson	f5432e35a3	[ROCm][CI] Loosen RemoteOpenAIServer Startup Timeout (#34922 ) Signed-off-by: Micah Williamson <micah.williamson@amd.com>	2026-02-20 05:37:49 +00:00
Almog Tavor	72d5951d02	[Bugfix] Treat generation_config max_tokens as default not ceiling (#34063 ) Signed-off-by: almogtavor <almogtavor@gmail.com>	2026-02-16 07:58:24 -08:00
Cyrus Leung	73391a1baa	[Renderer] Move InputPreprocessor into Renderer (1/2) (#34510 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2026-02-14 10:14:21 -08:00
Cyrus Leung	2f308214c0	[Refactor] Pass full VllmConfig to Renderer (#34485 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-02-12 22:48:38 -08:00
Cyrus Leung	fb455ed547	[V0 Deprecation] Remove code related to per-request logits processors (#34400 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-02-12 20:44:28 +08:00
Cyrus Leung	cd8b405bd0	[Refactor] Consolidate sequence normalization and enc-dec parsing (#33928 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-02-06 15:43:47 +00:00
Andrew Xia	e1bf04b6c2	[1/N] Initial Implementation of Parser for ResponsesAPI (#32712 ) Signed-off-by: Andrew Xia <axia@fb.com> Co-authored-by: Andrew Xia <axia@fb.com>	2026-02-04 10:59:03 +08:00
Cyrus Leung	f0a1c8453a	[Frontend] Use new Renderer for Completions and Tokenize API (#32863 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-01-31 04:51:15 -08:00
wangln19	2d7053438a	fix: preserve native tool call ID in multi-turn tool calling (#32768 ) Signed-off-by: wanglinian <wanglinian@stu.pku.edu.cn> Signed-off-by: wangln19 <96399074+wangln19@users.noreply.github.com> Signed-off-by: Roger Wang <hey@rogerw.io> Co-authored-by: Roger Wang <hey@rogerw.io> Co-authored-by: Isotr0py <2037008807@qq.com>	2026-01-27 10:22:35 +08:00
Cyrus Leung	d117a4d1a9	[Frontend] Introduce Renderer for processing chat messages (using `ModelConfig`) (#30200 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-01-22 12:44:22 +00:00
vanshil shah	037a6487af	apply _validate_input to MistralTokenizer token-id chat prompts (#32448 ) Signed-off-by: Vanshil Shah <vanshilshah@gmail.com>	2026-01-17 03:23:45 +00:00
Micah Williamson	46f8a982b1	[ROCm][CI] Enable AITER Unified Attention On ROCm For gpt-oss Test (#32431 ) Signed-off-by: Micah Williamson <micah.williamson@amd.com>	2026-01-16 00:55:57 +00:00
Chauncey	4c1c501a7e	[Refactor] [10/N] to simplify the vLLM openai completion serving architecture (#32369 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2026-01-15 07:41:34 +00:00
Aleksandr Samarin	d084e9fca7	[MODEL] Fix handling of multiple channels for gpt-oss with speculative decoding (#26291 ) Signed-off-by: Aleksandr Samarin <astrlrd@nebius.com> Signed-off-by: southfreebird <yvorott@gmail.com> Co-authored-by: southfreebird <yvorott@gmail.com>	2026-01-14 13:20:52 -05:00
Chauncey	fefce49807	[Refactor] [6/N] to simplify the vLLM openai chat_completion serving architecture (#32240 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2026-01-13 13:01:39 +00:00
amittell	9c884faa95	[Bugfix] Preserve tool call id/type/name in streaming finish chunk (#31438 ) Signed-off-by: amittell <mittell@me.com> Signed-off-by: Alex Mittell <mittell@me.com>	2025-12-29 21:10:52 +08:00
汪志鹏	3e92b2b7ac	[BugFix]fix gpt-oss v1/completions response bug (#30608 ) Signed-off-by: princepride <wangzhipeng628@gmail.com> Signed-off-by: 汪志鹏 <wangzhipeng628@gmail.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk> Co-authored-by: bbrowning <bbrownin@redhat.com>	2025-12-21 10:39:31 +08:00
lif	086b96339f	[Bugfix] Add validation for tool requests when tool_parser is unavailable (#30613 ) Signed-off-by: majiayu000 <1835304752@qq.com> Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-19 18:23:28 +08:00
PlatinumGod	6a09612b2e	[Bugfix] Fix tool_choice="none" being ignored by GPT-OSS/harmony models (#30867 ) Signed-off-by: yujiepu <pyjapple@gmail.com> Signed-off-by: PlatinumGod <pyjapple@gmail.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com>	2025-12-19 09:34:27 +08:00
inkcherry	500f26e6d3	[Bugfix] fix DP-aware routing in OpenAI API requests (#29002 ) Signed-off-by: inkcherry <mingzhi.liu@amd.com>	2025-12-18 09:50:42 -08:00
Matthew Bonanni	7eb6cb6c18	[Attention] Update tests to remove deprecated env vars (#30563 ) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>	2025-12-17 09:49:59 -08:00
Chauncey	2a1776b7ac	[Refactor] [2/N] Move tool parsers into the vLLM main directory (#30675 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-12-15 12:54:52 +00:00
Cyrus Leung	dcb31196da	[Chore] Remove redundant `RequestPrompt` (#30612 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-12-14 09:22:37 +00:00
Ben Browning	8f8fda261a	[Bugfix] Multiple fixes for gpt-oss Chat Completion prompting (#28729 ) Signed-off-by: Ben Browning <bbrownin@redhat.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com>	2025-12-12 12:59:53 +08:00
Cyrus Leung	e83b7e379c	Revert "[Renderer] Separate out `RendererConfig` from `ModelConfig` (#30145 )" (#30199 )	2025-12-07 00:00:22 -08:00
Cyrus Leung	27f4c2fd46	[Renderer] Separate out `RendererConfig` from `ModelConfig` (#30145 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-12-06 23:15:42 -08:00
Cyrus Leung	653591d5e7	[Chore] Move tokenizer initialization methods (#29793 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-12-02 13:33:37 +08:00
Cyrus Leung	b2c50eda50	[Bugfix] Fix wrong mock attribute (#29704 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-11-29 10:30:41 +08:00
Cyrus Leung	8d9338fae4	[Chore] Rename `Processor` to `InputProcessor` (#29682 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-11-28 09:35:41 -08:00
Isotr0py	3f5a4b6473	[Bugfix] Validate custom logits processor xargs for online serving (#27560 ) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-11-05 16:53:33 +00:00
cong-meta	a2981c4272	[EP/DP][API Server] Enable DP-aware routing in OpenAI API requests (#24945 ) Co-authored-by: Cong Chen <prowindy@gmail.com>	2025-10-30 12:10:16 -07:00
Harry Mellor	8fcaaf6a16	Update `Optional[x]` -> `x \| None` and `Union[x, y]` to `x \| y` (#26633 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-12 09:51:31 -07:00
Chauncey	720d3cd0f0	[CI] fix ruff format (#26579 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-10 03:02:12 -07:00
Luis Tomas Bolivar	3ee202ea1e	[GPT-OSS] Add support for arrays at tool message content (#25593 ) Signed-off-by: Luis Tomas Bolivar <ltomasbo@redhat.com>	2025-10-10 09:00:45 +00:00
Cyrus Leung	4bdf7ac593	[Bugfix] Fix SHM cache initialization (#26427 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-09 02:48:04 -07:00
Harry Mellor	1c0c68202c	Fix per file ruff ignores related to typing (#26254 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-05 16:37:55 +00:00
Harry Mellor	d6953beb91	Convert formatting to use `ruff` instead of `yapf` + `isort` (#26247 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-05 07:06:22 -07:00
Yang Liu	812b7f54a8	[Renderer] Move Processor out of AsyncLLM (#24138 ) Signed-off-by: Yang <lymailforjob@gmail.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-03 11:29:45 +00:00
Russell Bryant	3958b96bf5	Add option to restrict media domains (#25783 ) Signed-off-by: Chenheli Hua <huachenheli@outlook.com> Signed-off-by: Russell Bryant <rbryant@redhat.com> Co-authored-by: Chenheli Hua <huachenheli@outlook.com>	2025-09-27 01:23:52 +00:00
Matthew Bonanni	3468f17ebe	[V0 deprecation] Remove _VLLM_V1 suffixes from attention backend names (#25489 ) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: Matthew Bonanni <mbonanni001@gmail.com>	2025-09-25 17:37:50 +00:00
Ben Browning	5caaeb714c	[Bugfix] [Frontend] Cleanup gpt-oss non-streaming chat tool calls (#25514 ) Signed-off-by: Ben Browning <bbrownin@redhat.com>	2025-09-24 03:20:38 +00:00
Aaron Pham	29283e8976	[Chore] Cleanup guided namespace, move to structured outputs config (#22772 ) Signed-off-by: Aaron Pham <contact@aarnphm.xyz> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-18 09:20:27 +00:00
Woosuk Kwon	5801e49776	[V0 Deprecation] Remove MQLLMEngine (#25019 ) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>	2025-09-16 21:29:27 -07:00
Harry Mellor	c4afdb69cc	Move `MultiModalConfig` from `config/__init__.py` to `config/multimodal.py` (#24659 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-09-15 17:43:16 +00:00
Harry Mellor	c1eda615ba	Fix model name included in responses (#24663 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-11 10:47:51 -07:00
lacora	0b9a612fa3	[BugFix][easy] Fix flaky test test_gpt_oss_multi_turn_chat (#24549 ) Signed-off-by: lacora2017 <yehu@meta.com> Co-authored-by: lacora2017 <yehu@meta.com>	2025-09-10 21:14:55 +08:00
Aaron Pham	fb691ee4e7	[Fix] [gpt-oss] fix non-tool calling path for chat completion (#24324 )	2025-09-06 19:10:32 +00:00
Aaron Pham	c29fb540ff	[gpt-oss] tool parser supports for /chat/completions [1/n] (#22386 ) Signed-off-by: Aaron Pham <contact@aarnphm.xyz> Co-authored-by: Simon Mo <simon.mo@hey.com>	2025-09-04 20:39:12 -07:00
Didier Durand	d7e1e59972	[Doc]: fix typos in Python comments (#24093 ) Signed-off-by: Didier Durand <durand.didier@gmail.com>	2025-09-02 21:05:45 -07:00
Marko Rosenmueller	80141bbf2f	fix: use cache_salt for gpt-oss (#23186 ) Signed-off-by: Marko Rosenmueller <5467316+dr75@users.noreply.github.com>	2025-08-19 18:12:25 +00:00

1 2

80 Commits