vllm/tests/v1/attention at b6e04390d3ea5ebc79ac70d1b76d638c56fa8ce2 - vllm

Files

Matthew Bonanni b30dfa03c5 [Attention] Refactor CUDA attention backend selection logic (#24794 )

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>
Signed-off-by: Matthew Bonanni <mbonanni001@gmail.com>
Co-authored-by: Luka Govedič <ProExpertProg@users.noreply.github.com>

2025-11-11 07:40:44 -05:00

test_attention_backends_selection.py

…

test_attention_backends.py

[Attention] Refactor CUDA attention backend selection logic (#24794 )

2025-11-11 07:40:44 -05:00

test_attention_splitting.py

…

test_batch_reordering.py

…

test_chunked_local_attention.py

…

test_mla_backends.py

[Attention] Refactor CUDA attention backend selection logic (#24794 )

2025-11-11 07:40:44 -05:00

test_sparse_mla_backends.py

Add TP parameter to attention tests (#27683 )

2025-11-03 13:04:40 -08:00

utils.py

[Attention] Refactor CUDA attention backend selection logic (#24794 )

2025-11-11 07:40:44 -05:00