This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
0be9516ea43df0fcb24bf50021e22768a49d61cf
vllm
/
vllm
/
model_executor
/
layers
/
pooler
History
Wentao Ye
7b01d97a22
[Perf] Optimize mean pooling using chunks and index_add, 5.9% E2E throughput improvement (
#38559
)
...
Signed-off-by: yewentao256 <
zhyanwentao@126.com
>
2026-04-01 03:54:58 +00:00
..
seqwise
[Perf] Optimize mean pooling using chunks and index_add, 5.9% E2E throughput improvement (
#38559
)
2026-04-01 03:54:58 +00:00
tokwise
[Model] Deprecate the score task (this will not affect users). (
#37537
)
2026-03-20 08:07:56 +00:00
__init__.py
[Model] Reorganize pooling layers (
#31973
)
2026-01-09 11:02:14 +00:00
abstract.py
[Model] Reorganize pooling layers (
#31973
)
2026-01-09 11:02:14 +00:00
activations.py
[Model] Deprecate the score task (this will not affect users). (
#37537
)
2026-03-20 08:07:56 +00:00
common.py
[Perf] Remove redundant device copies for CPU-only pooling token IDs, 48.9% E2E throughput improvement (
#38139
)
2026-03-29 18:12:50 +00:00
special.py
[Perf] Remove redundant device copies for CPU-only pooling token IDs, 48.9% E2E throughput improvement (
#38139
)
2026-03-29 18:12:50 +00:00