This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
2be765b68ace5efe4cba4d03495f543c85784688
vllm
/
vllm
/
model_executor
/
layers
/
attention
History
Matthew Bonanni
20228cb851
[3/N][Attention] Move AttentionMetadata-related code from utils.py to backend.py (
#32054
)
...
Signed-off-by: Matthew Bonanni <
mbonanni@redhat.com
>
2026-01-12 09:13:56 -08:00
..
__init__.py
[1/N][Attention] Restructure attention: move files (
#31916
)
2026-01-09 13:10:24 -08:00
chunked_local_attention.py
[3/N][Attention] Move AttentionMetadata-related code from utils.py to backend.py (
#32054
)
2026-01-12 09:13:56 -08:00
cross_attention.py
[3/N][Attention] Move AttentionMetadata-related code from utils.py to backend.py (
#32054
)
2026-01-12 09:13:56 -08:00
encoder_only_attention.py
[3/N][Attention] Move AttentionMetadata-related code from utils.py to backend.py (
#32054
)
2026-01-12 09:13:56 -08:00
mm_encoder_attention.py
[Misc][LLaMa4] Compile LLaMa Vision Encoder (
#30709
)
2026-01-09 22:01:38 -05:00
static_sink_attention.py
[3/N][Attention] Move AttentionMetadata-related code from utils.py to backend.py (
#32054
)
2026-01-12 09:13:56 -08:00