vllm/tests/kernels/attention at fa72f9a8126051abb9d00144f11aeb7615f36d21 - vllm

Files

Hosang dd5fa7e04f [ROCm][Kernel][V1] Enable AMD Radeon GPU Custom Paged Attention on v1 (#17004 )

Signed-off-by: Hosang Yoon <hosang.yoon@amd.com>

2025-05-21 08:35:00 -07:00

conftest.py

Categorize tests/kernels/ based on kernel type (#16799 )

2025-04-23 09:21:07 -04:00

test_attention_selector.py

fix broken test vllm:test_kernels - test_attention_selector.py::test_flash_attn (#17873 )

2025-05-10 10:46:54 +08:00

test_attention.py

[ROCm][Kernel][V1] Enable AMD Radeon GPU Custom Paged Attention on v1 (#17004 )

2025-05-21 08:35:00 -07:00

test_blocksparse_attention.py

Categorize tests/kernels/ based on kernel type (#16799 )

2025-04-23 09:21:07 -04:00

test_cache.py

Allocate kv_cache with stride order (#16605 )

2025-04-25 22:03:31 -07:00

test_cascade_flash_attn.py

Categorize tests/kernels/ based on kernel type (#16799 )

2025-04-23 09:21:07 -04:00

test_encoder_decoder_attn.py

Categorize tests/kernels/ based on kernel type (#16799 )

2025-04-23 09:21:07 -04:00

test_flash_attn.py

Update test_flash_attn.py (#17102 )

2025-04-26 22:17:35 +00:00

test_flashinfer.py

Categorize tests/kernels/ based on kernel type (#16799 )

2025-04-23 09:21:07 -04:00

test_flashmla.py

[Bugfix] Fix triton import with local TritonPlaceholder (#17446 )

2025-05-06 17:53:09 +08:00

test_lightning_attn.py

Categorize tests/kernels/ based on kernel type (#16799 )

2025-04-23 09:21:07 -04:00

test_merge_attn_states.py

Categorize tests/kernels/ based on kernel type (#16799 )

2025-04-23 09:21:07 -04:00

test_mha_attn.py

Categorize tests/kernels/ based on kernel type (#16799 )

2025-04-23 09:21:07 -04:00

test_mla_decode_cpu.py

Categorize tests/kernels/ based on kernel type (#16799 )

2025-04-23 09:21:07 -04:00

test_prefix_prefill.py

Categorize tests/kernels/ based on kernel type (#16799 )

2025-04-23 09:21:07 -04:00

test_rocm_attention_selector.py

[FEAT][ROCm]: Support AITER MLA on V1 Engine (#17523 )

2025-05-09 10:42:05 +08:00

test_triton_decode_attention.py

Categorize tests/kernels/ based on kernel type (#16799 )

2025-04-23 09:21:07 -04:00

test_triton_unified_attention.py

[Bugfix] Fix fp8 tests for triton_unified_attention for Triton 3.3 (#18013 )

2025-05-15 13:26:34 +08:00