vllm/vllm/worker at d7afab6d3af84c18ecb9cbc478842e3bf62af906 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Woosuk Kwon 25e86b6a61 Don't use cupy NCCL for AMD backends (#2855 )

2024-02-14 12:30:44 -08:00

..

[Speculative decoding 2/9] Multi-step worker for draft model (#2424 )

2024-01-21 16:31:47 -08:00

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

cache_engine.py

Remove hardcoded device="cuda" to support more devices (#2503 )

2024-02-01 15:46:39 -08:00

model_runner.py

Don't use cupy NCCL for AMD backends (#2855 )

2024-02-14 12:30:44 -08:00

worker.py

Don't use cupy NCCL for AMD backends (#2855 )

2024-02-14 12:30:44 -08:00