vllm/examples/pooling at b6d5a17298548e77cf5af456e029e5beb26b253c - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Jakub Zakrzewski 111d869069 [Model] Add nvidia/llama-nemotron-embed-vl-1b-v2 multimodal embedding model (#35297 )

Signed-off-by: Jakub Zakrzewski <jzakrzewski@nvidia.com>

2026-02-26 14:17:17 +00:00

..

[Doc] Update usage of --limit-mm-per-prompt (#34148 )

2026-02-09 21:12:13 -08:00

[Model] Add nvidia/llama-nemotron-embed-vl-1b-v2 multimodal embedding model (#35297 )

2026-02-26 14:17:17 +00:00

(bugfix): Fixed encode in LLM entrypoint for IOProcessr plugin prompts (#34618 )

2026-02-16 07:33:55 -08:00

[Frontend][last/5] Make pooling entrypoints request schema consensus. (#31127 )

2026-02-09 06:42:38 +00:00

[New Model] Add ColModernVBERT (#34558 )

2026-02-22 12:23:41 +08:00

[Frontend][2/n] Make pooling entrypoints request schema consensus | ChatRequest (#32574 )

2026-01-22 10:32:44 +00:00

[new model] add COLQwen3 code & Inference (#34398 )

2026-02-14 12:15:19 +08:00