vllm/tests/v1/e2e at c934caee88f65258aac00d71d9ae0ecc4a4e1cd7 - vllm

Files

Yong Hoon Shin 9324e10275 Fix KV sharing fast prefill with cudagraph enabled (#28537 )

Signed-off-by: Yong Hoon Shin <yhshin@meta.com>
Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>

2025-11-14 11:53:42 +00:00

__init__.py

2025-01-01 21:56:46 +09:00

test_async_scheduling.py

2025-11-01 00:35:04 +00:00

test_cascade_attention.py

2025-10-07 15:42:31 +00:00

test_context_length.py

2025-11-13 10:18:47 -08:00

test_correctness_sliding_window.py

2025-10-07 15:42:31 +00:00

test_kv_sharing_fast_prefill.py

2025-11-14 11:53:42 +00:00

test_lora_with_spec_decode.py

2025-11-08 01:58:22 +00:00

test_min_tokens.py

2025-10-12 09:51:31 -07:00

test_pooling_chunked_prefill.py

2025-10-14 07:45:04 +08:00

test_spec_decode.py

2025-11-09 16:04:59 +00:00