vllm/tests/models/language at 458e74eb907f96069e6d8a4f3c9f457001fef2ea - vllm

Files

wang.yuqi 84cf78acee [Model] Pooling models default to using chunked prefill & prefix caching if supported. (#20930 )

Signed-off-by: wang.yuqi <noooop@126.com>

2025-08-11 09:41:37 -07:00

2025-08-09 20:16:11 -07:00

2025-08-11 09:41:37 -07:00

__init__.py

2025-04-30 23:03:08 -07:00