docs/usage/README.md

# Using vLLM

First, vLLM must be [installed](../getting_started/installation/) for your chosen device in either a Python or Docker environment.

Then, vLLM supports the following usage patterns:

- [Inference and Serving](../serving/offline_inference.md): Run a single instance of a model.
- [Deployment](../deployment/docker.md): Scale up model instances for production.
- [Training](../training/rlhf.md): Train or fine-tune a model.
[Doc] Reorganize user guide (#18661) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-05-24 22:25:33 +08:00			`# Using vLLM`

[Docs] Enable `fail_on_warning` for the docs build in CI (#25580) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-09-24 20:30:33 +01:00			`First, vLLM must be [installed](../getting_started/installation/) for your chosen device in either a Python or Docker environment.`
[Docs] Improve docs navigation (#22720) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-08-12 12:25:55 +01:00
			`Then, vLLM supports the following usage patterns:`
[Doc] Reorganize user guide (#18661) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-05-24 22:25:33 +08:00
			`- [Inference and Serving](../serving/offline_inference.md): Run a single instance of a model.`
			`- [Deployment](../deployment/docker.md): Scale up model instances for production.`
			`- [Training](../training/rlhf.md): Train or fine-tune a model.`