vllm/tests/v1/attention at 245e4f2c01d19a567742fee4117badf1f6027da0 - vllm

Files

Huamin Li c312320764 [CI/Build] tests(v1): feed Triton attention the (num_blocks, 2, …) KV cache layout in backend-correctness tests (#26663 )

Signed-off-by: Huamin Li <3ericli@gmail.com>
Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>

2025-10-17 21:11:26 -07:00

test_attention_backends_selection.py

2025-10-05 07:06:22 -07:00

test_attention_backends.py

2025-10-17 21:11:26 -07:00

test_attention_splitting.py

2025-10-07 01:57:49 +00:00

test_chunked_local_attention.py

2025-10-05 07:06:22 -07:00

test_mla_backends.py

2025-10-14 19:38:20 +00:00

test_sparse_mla_backends.py

2025-10-08 10:09:34 +08:00

utils.py

2025-10-17 00:48:59 +00:00