vllm/tests/v1/worker at 5a71cdd76ebc4f55a7490e087d2a50bd892ab3bc - vllm

Files

Wentao Ye c34ba6b961 [Perf] Optimize compute maxsim using batched version, 3.2% E2E throughput improvement (#36710 )

Signed-off-by: yewentao256 <zhyanwentao@126.com>

2026-03-12 08:37:01 +08:00

__init__.py

2024-12-26 19:02:58 +09:00

test_gpu_input_batch.py

2025-10-23 19:08:06 +00:00

test_gpu_model_runner.py

2026-03-11 01:11:23 -07:00

test_gpu_profiler.py

2026-01-22 09:45:40 -08:00

test_late_interaction_runner.py

2026-03-12 08:37:01 +08:00

test_mamba_utils.py

2026-02-28 01:50:37 +08:00

test_utils.py

2026-01-27 10:02:51 -05:00

test_worker_memory_snapshot.py

2026-02-28 04:46:42 +00:00