vllm/vllm/model_executor/models/transformers at 9d0d7f48d55ae2e1933564491cfa1f97682fde1a - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Cyrus Leung 9101dc756c [Model] Avoid hardcoding pooling type (#32119 )

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

2026-01-11 21:28:12 -08:00

..

__init__.py

[Docs] Update the name of Transformers backend -> Transformers modeling backend (#28725 )

2025-11-14 16:34:14 +00:00

base.py

[1/N][Attention] Restructure attention: move files (#31916 )

2026-01-09 13:10:24 -08:00

causal.py

[Docs] Update the name of Transformers backend -> Transformers modeling backend (#28725 )

2025-11-14 16:34:14 +00:00

legacy.py

[Docs] Update the name of Transformers backend -> Transformers modeling backend (#28725 )

2025-11-14 16:34:14 +00:00

moe.py

[Doc] Add developer guide for CustomOp (#30886 )

2026-01-09 16:21:11 +00:00

multimodal.py

[ROCm][CI][Bugfix] Multi-Modal Model Support Fixes and Attention Backend Improvements (#30270 )

2025-12-19 02:17:27 +00:00

pooling.py

[Model] Avoid hardcoding pooling type (#32119 )

2026-01-11 21:28:12 -08:00

utils.py

Replace nn.ConvNd with vLLM's ConvNdLayer for Transformers modeling backend (#31498 )

2025-12-29 16:20:01 +00:00