Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
7b5a8b4a9dd6eb26057e3c8e0fa07db0d89f6d54
vllm/vllm/v1/core
History
junuxyz c5a66d1697 [Core][BugFix] Fix PP KV cache sharding memory validation (#33698)
Signed-off-by: junuxyz <216036880+junuxyz@users.noreply.github.com>
2026-02-10 10:46:24 -05:00
..
sched
[V1][BugFix] Fix EAGLE3 encoder cache miss with disable_chunked_mm_input (#34220)
2026-02-10 13:05:32 +00:00
__init__.py
[V1] Implement vLLM V1 [1/N] (#9289)
2024-10-22 01:24:07 -07:00
block_pool.py
[V1][Hybrid] Mamba Prefix Caching with align mode (#30877)
2026-01-23 09:56:48 -08:00
encoder_cache_manager.py
[Refactor] Move profiling methods to MM budget (#33559)
2026-02-02 23:27:00 +08:00
kv_cache_coordinator.py
[BugFix] Avoid prefix cache hit in the same schedule step for mamba layers (#29387)
2026-02-10 07:41:16 +00:00
kv_cache_manager.py
[BugFix] Avoid prefix cache hit in the same schedule step for mamba layers (#29387)
2026-02-10 07:41:16 +00:00
kv_cache_metrics.py
[Core][Observability] Add KV cache residency metrics (#27793)
2025-12-01 18:27:53 +00:00
kv_cache_utils.py
[Core][BugFix] Fix PP KV cache sharding memory validation (#33698)
2026-02-10 10:46:24 -05:00
single_type_kv_cache_manager.py
[BugFix] Avoid prefix cache hit in the same schedule step for mamba layers (#29387)
2026-02-10 07:41:16 +00:00
Powered by Gitea Version: 1.25.2 Page: 564ms Template: 2ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API