biondizzle/vllm - vllm - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Jerry Zhang	c8134bea15	Fix AOPerModuleConfig name changes (#18869 ) Signed-off-by: Jerry Zhang <jerryzh168@gmail.com>	2025-06-05 18:51:32 -07:00
Woosuk Kwon	b124e1085b	[Bugfix] Fix FA3 full cuda graph correctness (#19106 ) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>	2025-06-03 23:10:15 -07:00
Yan Ru Pei	b712be98c7	feat: add data parallel rank to KVEventBatch (#18925 )	2025-06-03 17:14:20 -07:00
Concurrensee	4ce42f9204	Adding "LoRA Test %N" to AMD production tests (#18929 ) Signed-off-by: Yida Wu <yidawu@alumni.cmu.edu>	2025-06-02 20:46:44 -07:00
Nick Hill	2dbe8c0774	[Perf] API-server scaleout with many-to-many server-engine comms (#17546 )	2025-05-30 08:17:00 -07:00
Rabi Mishra	5f1d0c8118	[Bugfix][Failing Test] Fix test_vllm_port.py (#18618 ) Signed-off-by: rabi <ramishra@redhat.com>	2025-05-30 17:13:47 +08:00
Rabi Mishra	b78f844a67	[Bugfix][FailingTest]Fix test_model_load_with_params.py (#18758 ) Signed-off-by: rabi <ramishra@redhat.com>	2025-05-28 05:42:54 +00:00
Mark McLoughlin	06a0338015	[V1][Metrics] Add API for accessing in-memory Prometheus metrics (#17010 ) Signed-off-by: Mark McLoughlin <markmc@redhat.com>	2025-05-27 09:37:06 +00:00
Cyrus Leung	82e2339b06	[Doc] Move examples and further reorganize user guide (#18666 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-05-26 07:38:04 -07:00
Isotr0py	0877750029	[CI/Build] Split pooling and generation extended language models tests in CI (#18705 ) Signed-off-by: Isotr0py <2037008807@qq.com>	2025-05-26 04:00:08 -07:00
Michael Goin	0ddf88e16e	[CI] Enable test_initialization to run on V1 (#16736 ) Signed-off-by: mgoin <mgoin64@gmail.com>	2025-05-23 15:09:44 -07:00
Cyrus Leung	6dd51c7ef1	[CI/Build] Fix V1 flag being set in entrypoints tests (#18598 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-05-23 05:51:53 -07:00
Harry Mellor	a1fe24d961	Migrate docs from Sphinx to MkDocs (#18145 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-05-23 02:09:53 -07:00
cascade	71ea614d4a	[Feature]Add async tensor parallelism using compilation pass (#17882 ) Signed-off-by: cascade812 <cascade812@outlook.com>	2025-05-23 01:03:34 -07:00
Sanger Steel	c32e249a23	[Frontend] [Core] Add Tensorizer support for V1, LoRA adapter serialization and deserialization (#17926 ) Signed-off-by: Sanger Steel <sangersteel@gmail.com>	2025-05-22 18:44:18 -07:00
David Xia	1f3a1200e4	[Bugfix] make `test_openai_schema.py` pass (#18224 ) Signed-off-by: David Xia <david@davidxia.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-05-22 18:34:06 +00:00
lkchen	a35a494745	[Bugfix] Add kwargs to RequestOutput __init__ to be forward compatible (#18513 ) Signed-off-by: Linkun <github@lkchen.net>	2025-05-22 05:24:43 -07:00
Rabi Mishra	61acfc45bc	[Bugfix][Failing Test] Fix test_events.py (#18460 ) Signed-off-by: rabi <ramishra@redhat.com>	2025-05-21 04:57:28 -07:00
Lucia Fang	3d2779c29a	[Feature] Support Pipeline Parallism in torchrun SPMD offline inference for V1 (#17827 ) Signed-off-by: Lucia Fang <fanglu@fb.com>	2025-05-15 22:28:27 -07:00
Alexei-V-Ivanov-AMD	0b34593017	Adding "AMD: Tensorizer Test" to amdproduction. (#18216 )	2025-05-15 11:01:25 -07:00
Alexei-V-Ivanov-AMD	566ec04c3d	Adding "Basic Models Test" and "Multi-Modal Models Test (Extended) 3" in AMD Pipeline (#18106 ) Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2025-05-15 08:49:23 -07:00
Mark McLoughlin	65334ef3b9	[V1][Metrics] Remove unused code (#18158 ) Signed-off-by: Mark McLoughlin <markmc@redhat.com>	2025-05-14 20:13:17 -07:00
Charlie Fu	7b2f28deba	[AMD][torch.compile] Enable silu+fp8_quant fusion for rocm (#18082 ) Signed-off-by: charlifu <charlifu@amd.com>	2025-05-13 22:13:56 -07:00
Nick Hill	ee5be834e7	[BugFix] Fix 4-GPU RLHF tests (#18007 ) Signed-off-by: Nick Hill <nhill@redhat.com>	2025-05-12 23:03:55 -07:00
Yang Wang	2b0db9b0e2	Enable standard language model for torhc nightly (#18004 ) Signed-off-by: Yang Wang <elainewy@meta.com>	2025-05-12 14:00:04 -07:00
Alexei-V-Ivanov-AMD	e9c730c9bd	Enabling "Weight Loading Multiple GPU Test - Large Models" (#18020 )	2025-05-12 13:05:33 -07:00
Jonathan Berkhahn	98ea35601c	[Lora][Frontend]Add default local directory LoRA resolver plugin. (#16855 ) Signed-off-by: jberkhahn <jaberkha@us.ibm.com>	2025-05-12 10:39:10 -07:00
Robert Shaw	d19110204c	[P/D] NIXL Integration (#17751 ) Signed-off-by: ApostaC <yihua98@uchicago.edu> Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com> Signed-off-by: rshaw@neuralmagic.com <robertgshaw2@gmail.com> Signed-off-by: Robert Shaw <rshaw@neuralmagic.com> Signed-off-by: mgoin <mgoin64@gmail.com> Signed-off-by: Nick Hill <nhill@redhat.com> Signed-off-by: Brent Salisbury <bsalisbu@redhat.com> Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by: ApostaC <yihua98@uchicago.edu> Co-authored-by: Robert Shaw <rshaw@neuralmagic.com> Co-authored-by: mgoin <mgoin64@gmail.com> Co-authored-by: Nick Hill <nhill@redhat.com> Co-authored-by: Tyler Michael Smith <tysmith@redhat.com> Co-authored-by: Brent Salisbury <bsalisbu@redhat.com>	2025-05-12 09:46:16 -07:00
Alexei-V-Ivanov-AMD	3b602cdea7	AMD conditional all test execution // new test groups (#17556 ) Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com> Signed-off-by: Yida Wu <yidawu@alumni.cmu.edu>	2025-05-09 15:35:58 -07:00
Michael Goin	950b71186f	Replace lm-eval bash script with pytest and use enforce_eager for faster CI (#17717 ) Signed-off-by: mgoin <mgoin64@gmail.com>	2025-05-06 18:00:10 -07:00
Harry Mellor	d6484ef3c3	Add full API docs and improve the UX of navigating them (#17485 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-05-03 19:42:43 -07:00
Yang Wang	b8b0859b5c	add more pytorch related tests for torch nightly (#17422 ) Signed-off-by: Yang Wang <elainewy@meta.com>	2025-05-02 03:29:59 -07:00
Cyrus Leung	48e925fab5	[Misc] Clean up test docstrings and names (#17521 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-05-01 05:19:32 -07:00
Cyrus Leung	afb4429b4f	[CI/Build] Reorganize models tests (#17459 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-30 23:03:08 -07:00
Huy Do	2c4f59afc3	Update PyTorch to 2.7.0 (#16859 )	2025-04-29 19:08:04 -07:00
Alexei-V-Ivanov-AMD	608968b7c5	Enabling multi-group kernel tests. (#17115 ) Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com>	2025-04-29 10:27:27 -07:00
cascade	690fe019f0	[Feature] support sequence parallelism using compilation pass (#16155 ) Signed-off-by: cascade812 <cascade812@outlook.com> Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com>	2025-04-27 06:29:35 -07:00
Michael Goin	6317a5174a	Categorize `tests/kernels/` based on kernel type (#16799 ) Signed-off-by: mgoin <mgoin64@gmail.com>	2025-04-23 09:21:07 -04:00
Russell Bryant	ce17db8085	[CI] Run v1/test_serial_utils.py in CI (#16996 ) Signed-off-by: Russell Bryant <rbryant@redhat.com>	2025-04-23 01:13:34 -07:00
Yang Wang	f67e9e9f22	add Dockerfile build vllm against torch nightly (#16936 ) Signed-off-by: Yang Wang <elainewy@meta.com>	2025-04-22 19:08:27 -07:00
Alexei-V-Ivanov-AMD	5536b30a4c	Fencing Kernels Tests for enabling on AMD (#16929 ) Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com>	2025-04-22 09:32:40 -07:00
Woosuk Kwon	4c41278b77	[CI/CD][V1] Add spec decode tests to CI (#16900 ) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>	2025-04-20 22:37:16 -07:00
Tarun Kumar	e37073efd7	Add property-based testing for vLLM endpoints using an API defined by an OpenAPI 3.1 schema (#16721 ) Signed-off-by: Tarun Kumar <takumar@redhat.com> Signed-off-by: Nick Hill <nhill@redhat.com> Co-authored-by: Nick Hill <nhill@redhat.com>	2025-04-17 21:08:27 -07:00
Robert Shaw	2b05b8ce69	[V1][Frontend] Improve Shutdown And Logs (#11737 ) Signed-off-by: rshaw@neuralmagic.com <rshaw@neuralmagic.com> Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com> Signed-off-by: Nick Hill <nhill@redhat.com> Co-authored-by: rshaw@neuralmagic.com <rshaw@neuralmagic.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by: Russell Bryant <rbryant@redhat.com> Co-authored-by: Andrew Feldman <afeldman@neuralmagic.com> Co-authored-by: afeldman-nm <156691304+afeldman-nm@users.noreply.github.com> Co-authored-by: Nick Hill <nhill@redhat.com>	2025-04-16 19:48:34 -07:00
Shinichi Hemmi	3badb0213b	[Model] Add PLaMo2 (#14323 ) Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com> Signed-off-by: shemmi <shemmi@preferred.jp> Co-authored-by: Kento Nozawa <nzw0301@preferred.jp> Co-authored-by: Hiroaki Mikami <mhiroaki@preferred.jp> Co-authored-by: Calvin Metzger <metzger@preferred.jp>	2025-04-15 19:31:30 -07:00
Michael Goin	b4fe16c75b	Add `vllm bench [latency, throughput]` CLI commands (#16508 ) Signed-off-by: mgoin <mgoin64@gmail.com>	2025-04-14 23:10:35 -07:00
Michael Goin	57504a4bcf	[CI][Bugfix] Add mistral_tool_use to Ci (#16517 ) Signed-off-by: mgoin <mgoin64@gmail.com>	2025-04-11 17:52:38 -07:00
Cyrus Leung	3d4c87758e	[Misc] Update transformers version limits of multi-modal tests (#16381 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-09 23:03:33 -07:00
Russell Bryant	fee5b8d37f	[Build/CI] Add tracing deps to vllm container image (#15224 ) Signed-off-by: Russell Bryant <rbryant@redhat.com>	2025-04-09 19:14:06 +00:00
Luka Govedič	9cdde47289	[BugFix] Fix fusion test and add them to CI (#16287 ) Signed-off-by: luka <luka@neuralmagic.com>	2025-04-08 23:46:45 -07:00

1 2 3 4 5 ...

293 Commits