vllm/tests/v1/attention at e5db3e2774fd16394f8a96a608263ff2416385c8 - vllm

Files

Lucas Wilkinson 3e41992fec [Attention] Use sparse prefill kernel for fp8 kv-cache in DeepSeek-v3.2 (#27532 )

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

2025-12-12 05:57:47 -08:00

test_attention_backends_selection.py

2025-10-05 07:06:22 -07:00

test_attention_backends.py

2025-12-01 13:12:51 +00:00

test_attention_splitting.py

2025-12-09 07:24:01 +00:00

test_batch_reordering.py

2025-10-29 21:39:34 -07:00

test_chunked_local_attention.py

2025-10-05 07:06:22 -07:00

test_mla_backends.py

2025-11-22 06:38:44 -08:00

test_rocm_attention_backends_selection.py

2025-11-27 11:19:09 -05:00

test_sparse_mla_backends.py

2025-12-12 05:57:47 -08:00

utils.py

2025-12-09 17:18:10 -08:00