This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
df78aeef084cf35eecc6ba52640de8c390c99543
vllm
/
tests
/
v1
/
attention
History
Nicolò Lucchesi
066209a045
[Attention] Refactor FA
block_size
limitations to hybrid models only (
#29084
)
...
Signed-off-by: NickLucche <
nlucches@redhat.com
>
2025-11-22 06:38:44 -08:00
..
test_attention_backends_selection.py
Convert formatting to use
ruff
instead of
yapf
+
isort
(
#26247
)
2025-10-05 07:06:22 -07:00
test_attention_backends.py
[Attention] Refactor CUDA attention backend selection logic (
#24794
)
2025-11-11 07:40:44 -05:00
test_attention_splitting.py
[Core] Simplify the Dp padding/should ubatch coordination logic (
#25768
)
2025-10-07 01:57:49 +00:00
test_batch_reordering.py
[BugFix] Reordering extend logic fix (
#27739
)
2025-10-29 21:39:34 -07:00
test_chunked_local_attention.py
Convert formatting to use
ruff
instead of
yapf
+
isort
(
#26247
)
2025-10-05 07:06:22 -07:00
test_mla_backends.py
[Attention] Refactor FA
block_size
limitations to hybrid models only (
#29084
)
2025-11-22 06:38:44 -08:00
test_sparse_mla_backends.py
Add TP parameter to attention tests (
#27683
)
2025-11-03 13:04:40 -08:00
utils.py
[Attention] Refactor CUDA attention backend selection logic (
#24794
)
2025-11-11 07:40:44 -05:00