vllm/examples/pooling at f74f1572ca3a0973d8db2187f0064bfecb6d5df2 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Kata Coder 5719a4e4e6 [Frontend] Support multimodal inputs for late-interaction scoring (ColQwen3) + NewModel: nvidia/nemotron-colembed (#34574 )

Signed-off-by: craftsangjae <craftsangjae@gmail.com>

2026-02-20 20:01:40 -08:00

..

[Doc] Update usage of --limit-mm-per-prompt (#34148 )

2026-02-09 21:12:13 -08:00

[CI][BugFix] ShellCheck cleanup to remove baseline and preserve runtime behavior (#34514 )

2026-02-17 12:22:56 +00:00

(bugfix): Fixed encode in LLM entrypoint for IOProcessr plugin prompts (#34618 )

2026-02-16 07:33:55 -08:00

[Frontend][last/5] Make pooling entrypoints request schema consensus. (#31127 )

2026-02-09 06:42:38 +00:00

[Frontend] Support multimodal inputs for late-interaction scoring (ColQwen3) + NewModel: nvidia/nemotron-colembed (#34574 )

2026-02-20 20:01:40 -08:00

[Frontend][2/n] Make pooling entrypoints request schema consensus | ChatRequest (#32574 )

2026-01-22 10:32:44 +00:00

[new model] add COLQwen3 code & Inference (#34398 )

2026-02-14 12:15:19 +08:00