Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
200da9a51751973740f4dc71f0d1e13cc5698cb0
vllm/vllm/v1/core
History
Chen Zhang 200da9a517 [v1] Move block management logic from KVCacheManager to SpecializedManager (#17474)
Signed-off-by: Chen Zhang <zhangch99@outlook.com>
2025-05-09 15:25:34 +00:00
..
sched
[Core][Feature] Input metadata dump on crash (#13407)
2025-05-07 22:15:09 +00:00
__init__.py
[V1] Implement vLLM V1 [1/N] (#9289)
2024-10-22 01:24:07 -07:00
block_pool.py
[V1][Metrics] add support for kv event publishing (#16750)
2025-04-30 07:44:45 -07:00
encoder_cache_manager.py
Enforce valid max_num_batched_tokens when disable_chunked_mm_input=True (#16447)
2025-04-11 08:09:52 +00:00
kv_cache_manager.py
[v1] Move block management logic from KVCacheManager to SpecializedManager (#17474)
2025-05-09 15:25:34 +00:00
kv_cache_utils.py
[Core] Prevent side-channel attacks via cache salting (#17045)
2025-04-30 20:27:21 +08:00
specialized_manager.py
[v1] Move block management logic from KVCacheManager to SpecializedManager (#17474)
2025-05-09 15:25:34 +00:00
Powered by Gitea Version: 1.25.2 Page: 84ms Template: 2ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API