Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
0be9516ea43df0fcb24bf50021e22768a49d61cf
vllm/vllm/model_executor/layers/pooler
History
Wentao Ye 7b01d97a22 [Perf] Optimize mean pooling using chunks and index_add, 5.9% E2E throughput improvement (#38559)
Signed-off-by: yewentao256 <zhyanwentao@126.com>
2026-04-01 03:54:58 +00:00
..
seqwise
[Perf] Optimize mean pooling using chunks and index_add, 5.9% E2E throughput improvement (#38559)
2026-04-01 03:54:58 +00:00
tokwise
[Model] Deprecate the score task (this will not affect users). (#37537)
2026-03-20 08:07:56 +00:00
__init__.py
[Model] Reorganize pooling layers (#31973)
2026-01-09 11:02:14 +00:00
abstract.py
[Model] Reorganize pooling layers (#31973)
2026-01-09 11:02:14 +00:00
activations.py
[Model] Deprecate the score task (this will not affect users). (#37537)
2026-03-20 08:07:56 +00:00
common.py
[Perf] Remove redundant device copies for CPU-only pooling token IDs, 48.9% E2E throughput improvement (#38139)
2026-03-29 18:12:50 +00:00
special.py
[Perf] Remove redundant device copies for CPU-only pooling token IDs, 48.9% E2E throughput improvement (#38139)
2026-03-29 18:12:50 +00:00
Powered by Gitea Version: 1.25.2 Page: 619ms Template: 2ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API