From 2f2c1d73a745d8a38d1a21a5865a7d53d8d616b7 Mon Sep 17 00:00:00 2001 From: Russell Bryant Date: Wed, 4 Mar 2026 13:01:42 -0500 Subject: [PATCH] [Docs] Upgrade dynamic LoRA warning to admonition block (#35218) Signed-off-by: Russell Bryant --- docs/features/lora.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/docs/features/lora.md b/docs/features/lora.md index ae0124a98..cf868eb14 100644 --- a/docs/features/lora.md +++ b/docs/features/lora.md @@ -106,7 +106,8 @@ curl http://localhost:8000/v1/completions \ In addition to serving LoRA adapters at server startup, the vLLM server supports dynamically configuring LoRA adapters at runtime through dedicated API endpoints and plugins. This feature can be particularly useful when the flexibility to change models on-the-fly is needed. -Note: Enabling this feature in production environments is risky as users may participate in model adapter management. +!!! warning + This feature comes with security risks. It should not be used in production unless it is an isolated, fully trusted environment. To enable dynamic LoRA configuration, ensure that the environment variable `VLLM_ALLOW_RUNTIME_LORA_UPDATING` is set to `True`.