vllm/examples/pooling at 9d07a3d6e472c8e5a231a34ec9c38084605b037d - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Harry Mellor c88510083b Fix Qwen2.5-VL test for Transformers v5 (#36532 )

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2026-03-10 12:05:34 +00:00

..

Fix Qwen2.5-VL test for Transformers v5 (#36532 )

2026-03-10 12:05:34 +00:00

Allow markdownlint to run locally (#36398 )

2026-03-08 20:05:24 -07:00

(bugfix): Fixed encode in LLM entrypoint for IOProcessr plugin prompts (#34618 )

2026-02-16 07:33:55 -08:00

[Frontend][last/5] Make pooling entrypoints request schema consensus. (#31127 )

2026-02-09 06:42:38 +00:00

[Model] Add support for nvidia/llama-nemotron-rerank-vl-1b-v2 (#35735 )

2026-03-03 08:32:14 +08:00

[Frontend][2/n] Make pooling entrypoints request schema consensus | ChatRequest (#32574 )

2026-01-22 10:32:44 +00:00

[new model] add COLQwen3 code & Inference (#34398 )

2026-02-14 12:15:19 +08:00