This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
bb39382b2b28b0571054fee4a266b96d7e33ab58
vllm
/
tests
/
v1
/
worker
History
wang.yuqi
a9b4f07ba2
[Frontend] Re-enable running MaxSim on GPU (
#38620
)
...
Signed-off-by: wang.yuqi <
yuqi.wang@daocloud.io
>
2026-04-03 00:03:13 +08:00
..
__init__.py
…
test_gpu_input_batch.py
[Perf] Remove redundant device copies for CPU-only pooling token IDs, 48.9% E2E throughput improvement (
#38139
)
2026-03-29 18:12:50 +00:00
test_gpu_model_runner_v2_eplb.py
[Feature] EPLB Support for GPU Model Runner v2 (
#37488
)
2026-03-25 08:16:39 -07:00
test_gpu_model_runner.py
[HMA]Fix corner case when hybrid page_size can not be evenly divided issue (blk_size=64,tp=4) (
#37467
)
2026-03-30 16:47:30 +00:00
test_gpu_profiler.py
Support custom URI schemes and trace handlers for profiler (
#32393
)
2026-01-22 09:45:40 -08:00
test_late_interaction_runner.py
[Frontend] Re-enable running MaxSim on GPU (
#38620
)
2026-04-03 00:03:13 +08:00
test_mamba_utils.py
[Hybrid] calling get_mamba_groups() once at MambaCopyBuffers.create() (
#37318
)
2026-03-21 09:29:43 +00:00
test_utils.py
[V0 Deprecation] Refactor kv cache from list to element (
#37487
)
2026-03-23 20:10:11 -07:00
test_worker_memory_snapshot.py
[Hardware] Replace torch.cuda.device_count/current_device/set_device API (
#36145
)
2026-03-12 07:57:47 -07:00