From 334c715e0f4f4de2d3de90bd0b9bba59df143eda Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Nicol=C3=B2=20Lucchesi?= Date: Thu, 12 Feb 2026 18:01:51 +0100 Subject: [PATCH] [Docs] Spec decoding docs warning removal (#34439) Signed-off-by: NickLucche --- docs/features/spec_decode/README.md | 5 ----- 1 file changed, 5 deletions(-) diff --git a/docs/features/spec_decode/README.md b/docs/features/spec_decode/README.md index 0d19ef839..0cc77ad4b 100644 --- a/docs/features/spec_decode/README.md +++ b/docs/features/spec_decode/README.md @@ -1,10 +1,5 @@ # Speculative Decoding -!!! warning - Please note that speculative decoding in vLLM is not yet optimized and does - not usually yield inter-token latency reductions for all prompt datasets or sampling parameters. - The work to optimize it is ongoing and can be followed here: - !!! warning Currently, speculative decoding in vLLM is not compatible with pipeline parallelism.