diff --git a/docs/serving/openai_compatible_server.md b/docs/serving/openai_compatible_server.md index 3020cb7b4..9d2b1e1ff 100644 --- a/docs/serving/openai_compatible_server.md +++ b/docs/serving/openai_compatible_server.md @@ -173,6 +173,14 @@ with `--enable-request-id-headers`. print(completion._request_id) ``` +## Offline API Documentation + +The FastAPI `/docs` endpoint requires an internet connection by default. To enable offline access in air-gapped environments, use the `--enable-offline-docs` flag: + +```bash +vllm serve NousResearch/Meta-Llama-3-8B-Instruct --enable-offline-docs +``` + ## API Reference ### Completions API