This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
c57523239587e2df78a7264e595a2ec3692c80e9
vllm
/
vllm
/
v1
/
core
History
Roger Wang
af51d80fa1
Revert "[V1] Scatter and gather placeholders in the model runner" (
#16075
)
2025-04-04 14:50:57 -07:00
..
sched
Revert "[V1] Scatter and gather placeholders in the model runner" (
#16075
)
2025-04-04 14:50:57 -07:00
__init__.py
[V1] Implement vLLM V1 [1/N] (
#9289
)
2024-10-22 01:24:07 -07:00
block_pool.py
[V1] Implement sliding window attention in kv_cache_manager (
#14097
)
2025-04-01 00:33:17 -07:00
encoder_cache_manager.py
[Misc] Avoid direct access of global
mm_registry
in
compute_encoder_budget
(
#15621
)
2025-03-27 17:52:00 +00:00
kv_cache_manager.py
[V1] Implement sliding window attention in kv_cache_manager (
#14097
)
2025-04-01 00:33:17 -07:00
kv_cache_utils.py
Revert "[V1] Scatter and gather placeholders in the model runner" (
#16075
)
2025-04-04 14:50:57 -07:00
specialized_manager.py
[V1] Implement sliding window attention in kv_cache_manager (
#14097
)
2025-04-01 00:33:17 -07:00