This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
9d07a3d6e472c8e5a231a34ec9c38084605b037d
vllm
/
examples
/
pooling
History
Harry Mellor
c88510083b
Fix Qwen2.5-VL test for Transformers v5 (
#36532
)
...
Signed-off-by: Harry Mellor <
19981378+hmellor@users.noreply.github.com
>
2026-03-10 12:05:34 +00:00
..
classify
Fix Qwen2.5-VL test for Transformers v5 (
#36532
)
2026-03-10 12:05:34 +00:00
embed
Allow
markdownlint
to run locally (
#36398
)
2026-03-08 20:05:24 -07:00
plugin
(bugfix): Fixed encode in LLM entrypoint for IOProcessr plugin prompts (
#34618
)
2026-02-16 07:33:55 -08:00
pooling
[Frontend][last/5] Make pooling entrypoints request schema consensus. (
#31127
)
2026-02-09 06:42:38 +00:00
score
[Model] Add support for nvidia/llama-nemotron-rerank-vl-1b-v2 (
#35735
)
2026-03-03 08:32:14 +08:00
token_classify
[Frontend][2/n] Make pooling entrypoints request schema consensus | ChatRequest (
#32574
)
2026-01-22 10:32:44 +00:00
token_embed
[new model] add COLQwen3 code & Inference (
#34398
)
2026-02-14 12:15:19 +08:00