vllm/tests/v1/e2e at 4e4d017b6f70c729e7c78f74e4328a4ebca7b8ec - vllm

Files

Arjun Reddy 111692bb8c [CI] Add end-to-end V1 min_tokens test coverage (#22495 )

Signed-off-by: Arjun Reddy <189282188+arjunbreddy22@users.noreply.github.com>
Co-authored-by: Arjun Reddy <189282188+arjunbreddy22@users.noreply.github.com>

2025-08-21 22:04:07 -06:00

__init__.py

[V1] Implement Cascade Attention (#11635 )

2025-01-01 21:56:46 +09:00

test_cascade_attention.py

[XPU] Use spawn with XPU multiprocessing (#20649 )

2025-07-09 00:34:28 -07:00

test_correctness_sliding_window.py

[KVCache] Make KVCacheSpec hashable (#21791 )

2025-07-29 19:58:29 +08:00

test_kv_sharing_fast_prefill.py

[CI] Fix tests/v1/e2e/test_kv_sharing_fast_prefill.py import on test (#22815 )

2025-08-13 10:35:50 -07:00

test_min_tokens.py

[CI] Add end-to-end V1 min_tokens test coverage (#22495 )

2025-08-21 22:04:07 -06:00

test_spec_decode.py

[Model] Support deepseek with eagle (#21086 )

2025-08-20 19:01:31 +08:00