This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
84e439a9cbbd68dd263ff49e73bc962f5e5ffbdd
vllm
/
vllm
/
v1
/
attention
History
Or Ozeri
7cc302dd87
[kv_offload+HMA][7/N]: Support register_kv_caches for hybrid models (
#37853
)
...
Signed-off-by: Or Ozeri <
oro@il.ibm.com
>
2026-03-27 08:38:33 +03:00
..
backends
[kv_offload+HMA][7/N]: Support register_kv_caches for hybrid models (
#37853
)
2026-03-27 08:38:33 +03:00
ops
[Bugfix][ROCm] Fix lru_cache on paged_mqa_logits_module (
#37547
)
2026-03-26 19:01:05 +00:00
__init__.py
[V1] Implement vLLM V1 [1/N] (
#9289
)
2024-10-22 01:24:07 -07:00
backend.py
[Attention] Support distinguishing between short extends and decodes (
#37303
)
2026-03-20 10:49:36 -07:00
selector.py
[Bugfix][Minor] Fix potential NameError in mamba backend selector and misc typos (
#35886
)
2026-03-26 11:59:24 -04:00