This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
bb3eb80d92f66eed4016fa5ed9d9e66125a14482
vllm
/
vllm
/
attention
History
baonudesifeizhai
6cbd41909e
Feature/vit attention unification# 23880 (
#23978
)
...
Signed-off-by: Isotr0py <
mozf@mail2.sysu.edu.cn
> Co-authored-by: Isotr0py <
mozf@mail2.sysu.edu.cn
>
2025-09-10 06:10:14 -07:00
..
backends
[Feature] Disallow FlashMLA on Blackwell (
#24521
)
2025-09-09 14:59:34 -04:00
layers
[Misc] Modify CacheConfig import (
#23459
)
2025-08-23 06:05:27 +00:00
ops
[V0 deprecation] Deprecate V0 Neuron backend (
#21159
)
2025-09-06 16:15:18 -07:00
utils
[Attention] FlashAttn MLA (
#14258
)
2025-09-04 02:47:59 -07:00
__init__.py
Remove duplicate entry in vllm.attention.__all__ (
#23296
)
2025-08-20 17:14:59 -07:00
layer.py
Feature/vit attention unification# 23880 (
#23978
)
2025-09-10 06:10:14 -07:00
selector.py
[gpt-oss] Enable gpt-oss on ampere (
#22714
)
2025-08-12 03:21:44 -07:00