Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
991d6bff38ff02f7cf47a3833efce58b27db8bb8
vllm/examples/pooling
History
Kata Coder 5719a4e4e6 [Frontend] Support multimodal inputs for late-interaction scoring (ColQwen3) + NewModel: nvidia/nemotron-colembed (#34574)
Signed-off-by: craftsangjae <craftsangjae@gmail.com>
2026-02-20 20:01:40 -08:00
..
classify
[Doc] Update usage of --limit-mm-per-prompt (#34148)
2026-02-09 21:12:13 -08:00
embed
[CI][BugFix] ShellCheck cleanup to remove baseline and preserve runtime behavior (#34514)
2026-02-17 12:22:56 +00:00
plugin
(bugfix): Fixed encode in LLM entrypoint for IOProcessr plugin prompts (#34618)
2026-02-16 07:33:55 -08:00
pooling
[Frontend][last/5] Make pooling entrypoints request schema consensus. (#31127)
2026-02-09 06:42:38 +00:00
score
[Frontend] Support multimodal inputs for late-interaction scoring (ColQwen3) + NewModel: nvidia/nemotron-colembed (#34574)
2026-02-20 20:01:40 -08:00
token_classify
[Frontend][2/n] Make pooling entrypoints request schema consensus | ChatRequest (#32574)
2026-01-22 10:32:44 +00:00
token_embed
[new model] add COLQwen3 code & Inference (#34398)
2026-02-14 12:15:19 +08:00
Powered by Gitea Version: 1.25.2 Page: 108ms Template: 10ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API