docs/deployment/frameworks/anyscale.md

# Anyscale

[](){ #deployment-anyscale }

[Anyscale](https://www.anyscale.com) is a managed, multi-cloud platform developed by the creators of Ray.

Anyscale automates the entire lifecycle of Ray clusters in your AWS, GCP, or Azure account, delivering the flexibility of open-source Ray
without the operational overhead of maintaining Kubernetes control planes, configuring autoscalers, managing observability stacks, or manually managing head and worker nodes with helper scripts like <gh-file:examples/online_serving/run_cluster.sh>.

When serving large language models with vLLM, Anyscale can rapidly provision [production-ready HTTPS endpoints](https://docs.anyscale.com/examples/deploy-ray-serve-llms) or [fault-tolerant batch inference jobs](https://docs.anyscale.com/examples/ray-data-llm).

## Production-ready vLLM on Anyscale quickstarts

- [Offline batch inference](https://console.anyscale.com/template-preview/llm_batch_inference?utm_source=vllm_docs)
- [Deploy vLLM services](https://console.anyscale.com/template-preview/llm_serving?utm_source=vllm_docs)
- [Curate a dataset](https://console.anyscale.com/template-preview/audio-dataset-curation-llm-judge?utm_source=vllm_docs)
- [Finetune an LLM](https://console.anyscale.com/template-preview/entity-recognition-with-llms?utm_source=vllm_docs)
Stop using title frontmatter and fix doc that can only be reached by search (#20623) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-07-08 11:27:40 +01:00			`# Anyscale`

[Docs] Add Anyscale to frameworks (#20590) Signed-off-by: Ricardo Decal <rdecal@anyscale.com> 2025-07-07 20:09:13 -07:00			`[](){ #deployment-anyscale }`

			`[Anyscale](https://www.anyscale.com) is a managed, multi-cloud platform developed by the creators of Ray.`
[Docs] Enhance Anyscale documentation, add quickstart links for vLLM (#21018) Signed-off-by: Ricardo Decal <rdecal@anyscale.com> 2025-07-15 22:46:56 -04:00
			`Anyscale automates the entire lifecycle of Ray clusters in your AWS, GCP, or Azure account, delivering the flexibility of open-source Ray`
			`without the operational overhead of maintaining Kubernetes control planes, configuring autoscalers, managing observability stacks, or manually managing head and worker nodes with helper scripts like <gh-file:examples/online_serving/run_cluster.sh>.`

[Docs] Add Anyscale to frameworks (#20590) Signed-off-by: Ricardo Decal <rdecal@anyscale.com> 2025-07-07 20:09:13 -07:00			`When serving large language models with vLLM, Anyscale can rapidly provision [production-ready HTTPS endpoints](https://docs.anyscale.com/examples/deploy-ray-serve-llms) or [fault-tolerant batch inference jobs](https://docs.anyscale.com/examples/ray-data-llm).`
[Docs] Enhance Anyscale documentation, add quickstart links for vLLM (#21018) Signed-off-by: Ricardo Decal <rdecal@anyscale.com> 2025-07-15 22:46:56 -04:00
			`## Production-ready vLLM on Anyscale quickstarts`

			`- [Offline batch inference](https://console.anyscale.com/template-preview/llm_batch_inference?utm_source=vllm_docs)`
			`- [Deploy vLLM services](https://console.anyscale.com/template-preview/llm_serving?utm_source=vllm_docs)`
			`- [Curate a dataset](https://console.anyscale.com/template-preview/audio-dataset-curation-llm-judge?utm_source=vllm_docs)`
			`- [Finetune an LLM](https://console.anyscale.com/template-preview/entity-recognition-with-llms?utm_source=vllm_docs)`