vllm/vllm/engine at 2e0b6e775756345aa1d39f772c186e00f8c29e92 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Zhuohan Li fd4ea8ef5c Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221 )

2024-01-03 11:30:22 -08:00

..

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

arg_utils.py

Update Help Text for --gpu-memory-utilization Argument (#2183 )

2023-12-18 11:33:24 -08:00

async_llm_engine.py

Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221 )

2024-01-03 11:30:22 -08:00

llm_engine.py

Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221 )

2024-01-03 11:30:22 -08:00

metrics.py

Add Production Metrics in Prometheus format (#1890 )

2023-12-02 16:37:44 -08:00

ray_utils.py

Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221 )

2024-01-03 11:30:22 -08:00