This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
9c3fe9936b929b5503d780bd4e8e3cd524de1c4e
vllm
/
tests
/
entrypoints
/
pooling
History
Wentao Ye
99c7892c5b
[Perf] Optimize maxsim scores computation for pooling models, 13.9% E2E throughput improvement (
#35330
)
...
Signed-off-by: yewentao256 <
zhyanwentao@126.com
>
2026-02-26 17:14:54 +00:00
..
basic
[Frontend] Use new Renderer for Completions and Tokenize API (
#32863
)
2026-01-31 04:51:15 -08:00
classify
[Frontend][last/5] Make pooling entrypoints request schema consensus. (
#31127
)
2026-02-09 06:42:38 +00:00
embed
[ROCm][CI] Fix flaky embedding chat test by using tolerance-based comparison (
#35050
)
2026-02-22 09:03:44 +00:00
pooling
[Refactor] Clean up pooling serial utils (
#33665
)
2026-02-03 10:29:18 +00:00
reward
[Model][6/N] Improve all pooling task | Support chunked prefill with ALL pooling (
#27145
)
2025-12-04 13:44:15 +00:00
score
[Perf] Optimize maxsim scores computation for pooling models, 13.9% E2E throughput improvement (
#35330
)
2026-02-26 17:14:54 +00:00
__init__.py
[CI] Split pooling from entrypoints Test (
#24632
)
2025-09-11 01:53:09 -07:00