vllm/tests/v1/core at 8a57872b2ac9b01004ae1d3a3a689de218ea5be5 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Nick Hill 2dbe8c0774 [Perf] API-server scaleout with many-to-many server-engine comms (#17546 )

2025-05-30 08:17:00 -07:00

..

test_kv_cache_utils.py

[Perf] API-server scaleout with many-to-many server-engine comms (#17546 )

2025-05-30 08:17:00 -07:00

test_prefix_caching.py

[Perf] API-server scaleout with many-to-many server-engine comms (#17546 )

2025-05-30 08:17:00 -07:00

test_scheduler_e2e.py

[Feature][V1]: suupports cached_tokens in response usage (#18149 )

2025-05-23 01:41:03 -07:00

test_scheduler.py

[Perf] API-server scaleout with many-to-many server-engine comms (#17546 )

2025-05-30 08:17:00 -07:00

test_specialized_manager.py

[v1][KVCacheManager] Avoid full cache hit by controlling max_length (#17999 )

2025-05-13 06:50:38 +00:00