biondizzle/vllm - vllm - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Chauncey	8e5e40daf4	[Misc] Provide a DeepSeek ReasoningParser with thinking enabled by default (#33221 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2026-01-28 21:16:53 +08:00
Harry Mellor	2eb673a088	Add flake8-implicit-str-concat rules to Ruff (#33191 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2026-01-28 04:56:10 +00:00
Roger Wang	b539f988e1	[Models] Kimi-K2.5 (#33131 ) Signed-off-by: wanglinian <wanglinian@stu.pku.edu.cn> Signed-off-by: wangln19 <96399074+wangln19@users.noreply.github.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by: youkaichao <youkaichao@gmail.com> Signed-off-by: Roger Wang <hey@rogerw.io> Co-authored-by: wanglinian <wanglinian@stu.pku.edu.cn> Co-authored-by: wangln19 <96399074+wangln19@users.noreply.github.com> Co-authored-by: Zaida Zhou <58739961+zhouzaida@users.noreply.github.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Nick Hill <nickhill123@gmail.com> Co-authored-by: youkaichao <youkaichao@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-01-27 14:50:31 +08:00
Wentao Ye	7ef5873752	[CI] Fix mypy for `vllm/v1/structured_output` (#32722 ) Signed-off-by: yewentao256 <zhyanwentao@126.com>	2026-01-23 11:55:51 +08:00
Chauncey	707b44cc28	[Refactor] [11/N] to simplify the mcp architecture (#32396 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2026-01-15 18:49:31 +08:00
Chauncey	9312a6c03a	[Refactor] [8/N] to simplify the vLLM openai responsesapi_serving architecture (#32260 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2026-01-14 07:26:24 +00:00
Chauncey	fefce49807	[Refactor] [6/N] to simplify the vLLM openai chat_completion serving architecture (#32240 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2026-01-13 13:01:39 +00:00
Chauncey	eaba8ece77	[Bugfix]: Fix Step3ReasoningParser missing is_reasoning_end_streaming (#31969 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2026-01-08 15:28:13 +00:00
Chauncey	0202971a48	[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false (#31788 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2026-01-06 13:53:21 +00:00
Yuxuan Zhang	0d4044edd8	fix no think of GLM-4.5 / GLM-4.7 (#31449 ) Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>	2026-01-04 11:43:00 +08:00
Kevin McKay	8c084de59d	[Misc] Fix spelling typos in comments (#31114 ) Signed-off-by: c0de128 <kevin.mckay@outlook.com>	2025-12-21 21:13:14 -08:00
高鑫崧	b7b6a60aca	Adapt the old parameter enable_thinking in chat_template_kwargs (#30852 ) Signed-off-by: xinsong.gao <1418762819@qq.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com>	2025-12-17 07:10:59 -08:00
Cyrus Leung	64251f48df	[Chore] Adjust tokenizer import to avoid circular imports (#30601 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-12-13 04:42:39 -08:00
Julien Denize	aa3c250c48	[IMPROVEMENT] Change MistralReasoningParser behavior (#30391 ) Signed-off-by: juliendenize <julien.denize@mistral.ai> Signed-off-by: Julien Denize <40604584+juliendenize@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2025-12-11 17:53:26 +01:00
Rei.	6299628d32	[bugfix] fix MiniMaxM2ReasoningParser streaming output not separating reasoning_content. (#29882 ) Signed-off-by: Rei <1477174254@qq.com>	2025-12-11 09:05:08 +00:00
Hubert de La Jonquiere	c72ea10723	[Structured Output][Reasoning] Improves decoding throughput for models using single-token reasoning endings. (#30056 )	2025-12-09 18:54:08 +08:00
Andrew Xia	421125d03a	[ez] move harmony utils to parser folder (#30117 ) Signed-off-by: Andrew Xia <axia@fb.com> Co-authored-by: Andrew Xia <axia@fb.com>	2025-12-06 17:34:34 -05:00
Alec S	2c174420f5	Reduce validation to a warning (#28749 ) Signed-off-by: Alec Solder <alecs@fb.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Alec Solder <alecs@fb.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-12-05 14:02:49 +00:00
Ning Xie	7ae13c66ba	[typing] fix type (#29964 ) Signed-off-by: Andy Xie <andy.xning@gmail.com>	2025-12-05 10:46:08 +00:00
Hubert de La Jonquiere	befb59e5b1	[Model] Add Holo2 reasoning parser (#30048 ) Signed-off-by: hdlj-h <hubert@hcompany.ai>	2025-12-05 10:38:45 +08:00
Chauncey	6796ce8bdb	[Bugfix] Fix the issue with interleaved thinking when using streaming (#30033 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com> Signed-off-by: Chauncey <chaunceyjiang@gmail.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2025-12-04 11:11:59 +00:00
Cyrus Leung	34a984274e	[Misc] Refactor tokenizer interface (#29693 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-11-29 04:02:21 -08:00
Harry Mellor	d9ab1ad9d1	`reasoning_content` -> `reasoning` (#27752 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-11-08 12:15:08 +00:00
Benjamin Chislett	18903216f5	[Bugfix] Fix and add tests for GptOss reasoning parser (#28000 ) Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>	2025-11-07 19:28:04 +00:00
Walter Beller-Morales	752ddeacaa	[Core] add support for reasoning parser plugins (#28075 ) Signed-off-by: walter beller-morales <walter.beller.morales@gmail.com>	2025-11-06 01:15:06 +08:00
bigmoyan	0606bea2b6	add kimi reasoning parser (#28128 ) Signed-off-by: wangzhengtao <wangzhengtao@msh.team> Co-authored-by: wangzhengtao <wangzhengtao@msh.team>	2025-11-05 21:48:33 +08:00
Chauncey	377061d481	[Misc] fix import error for DeepSeekR1ReasoningParser (#28114 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-11-05 19:02:32 +08:00
Chauncey	e261d37c9a	[Refactor] Lazy-loaded reasoning_parser (#28092 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-11-05 15:37:02 +08:00
CSWYF3634076	43a6acfb7d	[Model] fix ernie45 reasoning_parser (#27973 ) Signed-off-by: wangyafeng <wangyafeng@baidu.com>	2025-11-04 07:16:46 +00:00
Roger Young	720af6ab79	[Model][MiniMax-M2] Support MiniMax-M2 Model (#27535 ) Signed-off-by: xuebi <xuebi@minimaxi.com> Co-authored-by: xuebi <xuebi@minimaxi.com>	2025-10-27 00:59:11 +08:00
Cyrus Leung	d31f7844f8	[Misc] Move utils to avoid conflicts with stdlib, and move tests (#27169 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-19 05:20:55 -07:00
Hanchenli	7c572544e4	[GPT-OSS] Structure_Tag support for gpt-oss tool-call in cot (#25515 ) Signed-off-by: Hanchenli <lihanc2002@gmail.com> Signed-off-by: Hanchenli <61769611+Hanchenli@users.noreply.github.com> Signed-off-by: Wei Wei <wwei6@meta.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Wei Wei <wwei6@meta.com> Co-authored-by: Wei Wei <weiweinpu@gmail.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>	2025-10-17 21:55:54 -07:00
Cyrus Leung	4d4d6bad19	[Chore] Separate out `vllm.utils.importlib` (#27022 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-17 00:48:59 +00:00
Cyrus Leung	d2740fafbf	[Chore] Separate out `vllm.utils.collections` (#26990 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-16 08:35:35 +00:00
Tao Hui	85a65e7f51	[Model] Add DeepSeek-V3.1 reasoning parser (split from PR #24972 ) (#25589 ) Signed-off-by: taohui <taohui3@gmail.com> Signed-off-by: Tao Hui <taohui3@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com>	2025-10-15 11:09:52 +08:00
CSWYF3634076	782505ed8e	[Model] Add reasoning_parser and tool_parser for Ernie45 thinking (#25027 ) Signed-off-by: wangyafeng <wangyafeng@baidu.com>	2025-10-13 15:55:20 +08:00
Harry Mellor	8fcaaf6a16	Update `Optional[x]` -> `x \| None` and `Union[x, y]` to `x \| y` (#26633 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-12 09:51:31 -07:00
Chauncey	be067861c6	[Frontend] Improve the performance of `is_reasoning_end` (#25735 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-11 10:43:39 +08:00
Harry Mellor	d6953beb91	Convert formatting to use `ruff` instead of `yapf` + `isort` (#26247 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-05 07:06:22 -07:00
Luca Soldaini	d0df145c2a	Add Olmo 3 reasoning parser (#26054 ) Signed-off-by: Luca Soldaini <luca@soldaini.net>	2025-10-04 17:48:29 +08:00
Frank Wang	11aafd9886	[Bugfix] Improve GLM4 MoE Reasoning Parser's is_reasoning_end Condition (#25355 ) Signed-off-by: frankwang28 <frank.wbb@hotmail.com> Signed-off-by: Frank Wang <41319051+frankwang28@users.noreply.github.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com>	2025-09-26 11:54:00 -07:00
Tao Hui	b8d9e4a326	[Model] Add optional parameter to reasoning parser constructor (#25554 ) Signed-off-by: taohui <taohui3@gmail.com> Signed-off-by: Tao Hui <taohui3@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-09-26 01:12:50 +08:00
Harry Mellor	8c853050e7	[Docs] Enable `fail_on_warning` for the docs build in CI (#25580 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-24 19:30:33 +00:00
0xNullPath	be0bb568c9	[Model] Support SeedOss Reason Parser (#24263 ) Signed-off-by: Yan Lu <luyan@nvidia.com> Co-authored-by: Michael Goin <mgoin64@gmail.com>	2025-09-23 18:15:51 -06:00
Aaron Pham	c29fb540ff	[gpt-oss] tool parser supports for /chat/completions [1/n] (#22386 ) Signed-off-by: Aaron Pham <contact@aarnphm.xyz> Co-authored-by: Simon Mo <simon.mo@hey.com>	2025-09-04 20:39:12 -07:00
Didier Durand	9701352e4b	[Doc]: fix typos in Python comments (#24001 ) Signed-off-by: Didier Durand <durand.didier@gmail.com>	2025-08-31 08:21:59 +00:00
Nick Hill	f6b5040590	[Frontend] Avoid list copies in `serving_chat.py` (#22947 ) Signed-off-by: Nick Hill <nhill@redhat.com>	2025-08-16 02:06:30 +00:00
Chen Zhang	a47e6ffe93	[GptOss] Add GptOss reasoning parser to support structure output (#22322 ) Signed-off-by: Chen Zhang <zhangch99@outlook.com> Co-authored-by: LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by: simon-mo <xmo@berkeley.edu> Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by: Minseok Lee <47620120+minseokl@users.noreply.github.com> Co-authored-by: Yongye Zhu <zyy1102000@gmail.com>	2025-08-05 23:39:13 -07:00
Song	9484641616	[Model] Add step3 vl (#21998 ) Signed-off-by: oliveryuan <yuansong@step.ai> Co-authored-by: oliveryuan <yuansong@step.ai>	2025-07-31 23:19:06 +08:00
Yuxuan Zhang	85bda9e7d0	remove GLM-4.5 quantization wrong Code (#21435 )	2025-07-24 01:52:43 -07:00

1 2

66 Commits