vllm/tests/v1/worker at fe714dd5071d1e1f829ecfe4ee10d0d7e6144b5f - vllm

Files

Wentao Ye 7279374f91 [Perf] Compute maxsim in worker side, reducing redundant copies, 2.7% E2E throughput improvement (#36159 )

Signed-off-by: yewentao256 <zhyanwentao@126.com>

2026-03-09 20:55:58 -07:00

__init__.py

2024-12-26 19:02:58 +09:00

test_gpu_input_batch.py

2025-10-23 19:08:06 +00:00

test_gpu_model_runner.py

2026-03-07 22:09:55 +08:00

test_gpu_profiler.py

2026-01-22 09:45:40 -08:00

test_late_interaction_runner.py

2026-03-09 20:55:58 -07:00

test_mamba_utils.py

2026-02-28 01:50:37 +08:00

test_utils.py

2026-01-27 10:02:51 -05:00

test_worker_memory_snapshot.py

2026-02-28 04:46:42 +00:00