vllm/tests/v1/engine at 6d18ed2a2e858a8061dfe8c2e140c2c498d6a99a - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Nick Hill 2dbe8c0774 [Perf] API-server scaleout with many-to-many server-engine comms (#17546 )

2025-05-30 08:17:00 -07:00

..

__init__.py

[V1] AsyncLLM Implementation (#9826 )

2024-11-11 23:05:38 +00:00

conftest.py

[V1][Metrics] add support for kv event publishing (#16750 )

2025-04-30 07:44:45 -07:00

test_async_llm.py

[V1][Metrics] Allow V1 AsyncLLM to use custom logger (#14661 )

2025-04-25 22:05:40 -07:00

test_engine_args.py

[V1] Revert the default max_num_seqs to V0 values for most hardware (#16158 )

2025-04-07 13:54:36 -04:00

test_engine_core_client.py

[CI] don't skip fixed test_kv_cache_events() (#18183 )

2025-05-14 23:17:16 -07:00

test_engine_core.py

[Perf] API-server scaleout with many-to-many server-engine comms (#17546 )

2025-05-30 08:17:00 -07:00

test_llm_engine.py

[V1][Metrics] Add API for accessing in-memory Prometheus metrics (#17010 )

2025-05-27 09:37:06 +00:00

test_output_processor.py

[Core] Prevent side-channel attacks via cache salting (#17045 )

2025-04-30 20:27:21 +08:00

utils.py

Simplify TokenizerGroup (#16790 )

2025-04-24 04:43:56 -07:00