vllm/tests/v1/e2e at acaa2c0a4a53dbb57f85f1042b1a6f1e3f24cef5 - vllm

Files

Maximilien de Bayser d8bebb008a Add tests for chunked prefill and prefix cache with causal pooling models (#26526 )

Signed-off-by: Max de Bayser <mbayser@br.ibm.com>
Co-authored-by: Ayush Singh <ayush1009208@gmail.com>

2025-10-14 07:45:04 +08:00

__init__.py

2025-01-01 21:56:46 +09:00

test_async_sched_and_preempt.py

2025-10-10 23:27:04 +00:00

test_cascade_attention.py

2025-10-07 15:42:31 +00:00

test_correctness_sliding_window.py

2025-10-07 15:42:31 +00:00

test_kv_sharing_fast_prefill.py

2025-10-07 15:42:31 +00:00

test_min_tokens.py

2025-10-12 09:51:31 -07:00

test_pooling_chunked_prefill.py

2025-10-14 07:45:04 +08:00

test_spec_decode.py

2025-10-12 09:51:31 -07:00