vllm/tests/v1/e2e at 40b69e33e796efdc75e774a1c38cc73397ea6e17 - vllm

Files

Rémi Delacourt cec7c28833 [Bugfix] Padded Eagle Specdec with Chunked Prefill (#26263 )

Signed-off-by: Rémi Delacourt <remi@mistral.ai>
Signed-off-by: Rémi Delacourt <54138269+Flechman@users.noreply.github.com>
Signed-off-by: remi <remi@mistral.ai>
Co-authored-by: Benjamin Chislett <bchislett@nvidia.com>

2025-11-03 02:22:46 -05:00

__init__.py

[V1] Implement Cascade Attention (#11635 )

2025-01-01 21:56:46 +09:00

test_async_scheduling.py

[Core] Async scheduling + structured outputs compatibility (#26866 )

2025-11-01 00:35:04 +00:00

test_cascade_attention.py

[V0 Deprecation] Remove VLLM_USE_V1 from tests (#26341 )

2025-10-07 15:42:31 +00:00

test_correctness_sliding_window.py

[V0 Deprecation] Remove VLLM_USE_V1 from tests (#26341 )

2025-10-07 15:42:31 +00:00

test_kv_sharing_fast_prefill.py

[Frontend][torch.compile] CompilationConfig Overhaul (#20283 ): name change compilation level to compilation mode, deprecation compilation level (#26355 )

2025-10-15 02:51:16 +00:00

test_min_tokens.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

test_pooling_chunked_prefill.py

Add tests for chunked prefill and prefix cache with causal pooling models (#26526 )

2025-10-14 07:45:04 +08:00

test_spec_decode.py

[Bugfix] Padded Eagle Specdec with Chunked Prefill (#26263 )

2025-11-03 02:22:46 -05:00