vllm/tests/v1/worker at 487dd34e04137d859f8b4e9400d44ecde57e4cee - vllm

Files

Wentao Ye 995dea1354 [Perf] Remove redundant device copies for CPU-only pooling token IDs, 48.9% E2E throughput improvement (#38139 )

Signed-off-by: yewentao256 <zhyanwentao@126.com>

2026-03-29 18:12:50 +00:00

__init__.py

2024-12-26 19:02:58 +09:00

test_gpu_input_batch.py

2026-03-29 18:12:50 +00:00

test_gpu_model_runner_v2_eplb.py

2026-03-25 08:16:39 -07:00

test_gpu_model_runner.py

2026-03-23 20:10:11 -07:00

test_gpu_profiler.py

2026-01-22 09:45:40 -08:00

test_late_interaction_runner.py

2026-03-12 08:37:01 +08:00

test_mamba_utils.py

2026-03-21 09:29:43 +00:00

test_utils.py

2026-03-23 20:10:11 -07:00

test_worker_memory_snapshot.py

2026-03-12 07:57:47 -07:00