vllm/vllm/v1/core at c57523239587e2df78a7264e595a2ec3692c80e9 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Roger Wang af51d80fa1 Revert "[V1] Scatter and gather placeholders in the model runner" (#16075 )

2025-04-04 14:50:57 -07:00

..

Revert "[V1] Scatter and gather placeholders in the model runner" (#16075 )

2025-04-04 14:50:57 -07:00

__init__.py

[V1] Implement vLLM V1 [1/N] (#9289 )

2024-10-22 01:24:07 -07:00

block_pool.py

[V1] Implement sliding window attention in kv_cache_manager (#14097 )

2025-04-01 00:33:17 -07:00

encoder_cache_manager.py

[Misc] Avoid direct access of global mm_registry in compute_encoder_budget (#15621 )

2025-03-27 17:52:00 +00:00

kv_cache_manager.py

[V1] Implement sliding window attention in kv_cache_manager (#14097 )

2025-04-01 00:33:17 -07:00

kv_cache_utils.py

Revert "[V1] Scatter and gather placeholders in the model runner" (#16075 )

2025-04-04 14:50:57 -07:00

specialized_manager.py

[V1] Implement sliding window attention in kv_cache_manager (#14097 )

2025-04-01 00:33:17 -07:00