This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
458c1a4b2d21965ecd41b76ec0506ffe5ed8c8a1
vllm
/
tests
/
entrypoints
/
pooling
History
Wentao Ye
c34ba6b961
[Perf] Optimize compute maxsim using batched version, 3.2% E2E throughput improvement (
#36710
)
...
Signed-off-by: yewentao256 <
zhyanwentao@126.com
>
2026-03-12 08:37:01 +08:00
..
basic
…
classify
Fix Qwen2.5-VL test for Transformers v5 (
#36532
)
2026-03-10 12:05:34 +00:00
embed
feat: expose media_io_kwargs at runtime (
#34778
)
2026-03-07 04:27:04 +00:00
pooling
…
reward
…
score
[Perf] Optimize compute maxsim using batched version, 3.2% E2E throughput improvement (
#36710
)
2026-03-12 08:37:01 +08:00
__init__.py
…