[Frontend] deprecate --device arg (#18399)

Signed-off-by: Kebe <mail@kebe7jun.com>
This commit is contained in:
Kebe
2025-05-21 16:21:17 +08:00
committed by GitHub
parent cd8dfc6dfc
commit 5d7f545204
3 changed files with 9 additions and 4 deletions

View File

@@ -66,7 +66,6 @@ XPU platform supports **tensor parallel** inference/serving and also supports **
python -m vllm.entrypoints.openai.api_server \
--model=facebook/opt-13b \
--dtype=bfloat16 \
--device=xpu \
--max_model_len=1024 \
--distributed-executor-backend=ray \
--pipeline-parallel-size=2 \