vllm/tests/async_engine at 35c4bc20d9d454f58506b561b6770d3ae4752bf9 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Zhuohan Li fd4ea8ef5c Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221 )

2024-01-03 11:30:22 -08:00

..

api_server_async_engine.py

Migrate linter from pylint to ruff (#1665 )

2023-11-20 11:58:01 -08:00

test_api_server.py

Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221 )

2024-01-03 11:30:22 -08:00

test_async_llm_engine.py

TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )

2023-10-02 15:36:09 -07:00

test_openai_server.py

Support chat template and echo for chat API (#1756 )

2023-11-30 16:43:13 -08:00

test_request_tracker.py

Implement prompt logprobs & Batched topk for computing logprobs (#1328 )

2023-10-16 10:56:50 -07:00