biondizzle/vllm - vllm - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Cyrus Leung	b7030d962b	[Benchmark] Enable benchmark to run with `encoding_format="bytes"` (#27467 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-24 11:16:50 +00:00
Cyrus Leung	17838e50ef	[Benchmark] Use truncation by default for pooling benchmarks (#26992 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-16 16:02:39 +08:00
Maximilien de Bayser	fe3edb4cf0	Add support for the /rerank endpoint in vllm bench serve (#26602 ) Signed-off-by: Max de Bayser <mbayser@br.ibm.com>	2025-10-14 04:25:43 +00:00
Harry Mellor	8fcaaf6a16	Update `Optional[x]` -> `x \| None` and `Union[x, y]` to `x \| y` (#26633 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-12 09:51:31 -07:00
Cyrus Leung	5be7ca1b99	[Benchmark] Support Infinity API (#26641 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-12 01:45:32 +08:00
Cyrus Leung	44b9af5bb2	[Benchmark] Enable MM Embedding benchmarks (#26310 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-06 19:51:58 +00:00
Harry Mellor	d6953beb91	Convert formatting to use `ruff` instead of `yapf` + `isort` (#26247 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-05 07:06:22 -07:00
samzong	ce75e15373	refactor(benchmarks): add type annotations to wait_for_endpoint parameters (#25218 ) Signed-off-by: samzong <samzong.lu@gmail.com>	2025-09-19 16:36:52 +00:00
Simon Mo	a904ea78ea	[benchmark] add peak throughput metrics and plot (#23867 ) Signed-off-by: simon-mo <simon.mo@hey.com>	2025-09-17 22:30:02 -07:00
Clayton Coleman	bc636f21a6	[Benchmark] Allow arbitrary headers to be passed to benchmarked endpoints (#23937 ) Signed-off-by: Clayton Coleman <smarterclayton@gmail.com>	2025-09-12 13:57:53 -07:00
Ming Yang	1823a00d67	[Misc] Support bench serve long context (#24373 ) Signed-off-by: Ming Yang <minos.future@gmail.com>	2025-09-08 22:53:10 -07:00
Jiangyun Zhu	3ecbb14b81	[Benchmarks] add benchmark for embedding models (#23000 ) Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>	2025-08-25 23:57:08 -07:00
Jared O'Connell	31282401b6	[BugFix] Fix Python 3.9 Support (#23306 ) Signed-off-by: Jared O'Connell <46976761+jaredoconnell@users.noreply.github.com> Signed-off-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2025-08-20 23:23:56 -07:00
hustxiayang	31436e8b4f	[Misc] Add request_id into benchmark_serve.py (#23065 ) Signed-off-by: yangxia <yangxiast@gmail.com>	2025-08-19 08:32:18 +00:00
Breno Baldas Skuk	65a7917be4	Fix(benchmarks): allow multiple mm contents in OpenAI Chat Completion Benchmarks (#22534 ) Signed-off-by: breno.skuk <breno.skuk@hcompany.ai>	2025-08-10 09:03:15 -07:00
Seiji Eicher	6f5478298d	Use `aiohttp` connection pool for benchmarking (#21981 ) Signed-off-by: Seiji Eicher <seiji@anyscale.com>	2025-08-03 19:23:32 -07:00
Ye (Charlotte) Qi	3f36c325fa	[Benchmark] Support ready check timeout in `vllm bench serve` (#21696 ) Signed-off-by: Ye (Charlotte) Qi <yeq@meta.com> Co-authored-by: Roger Wang <hey@rogerw.me>	2025-08-03 00:52:38 -07:00

17 Commits