[Docs] Remove outdated async_scheduling limitation with speculative decoding (#32775)

Signed-off-by: Ifta Khairul Alam Adil <ikaadil007@gmail.com>
Signed-off-by: Ifta khairul Alam Adil <25082512+ikaadil@users.noreply.github.com>
This commit is contained in:
Ifta khairul Alam Adil
2026-01-22 05:19:25 +01:00
committed by GitHub
parent 24a163ed77
commit a1d82466ea

View File

@@ -133,9 +133,6 @@ class SchedulerConfig:
async_scheduling: bool = Field(default=None)
"""If set to False, disable async scheduling. Async scheduling helps to
avoid gaps in GPU utilization, leading to better latency and throughput.
It is currently not supported with some features such as
speculative decoding and pipeline parallelism, and will be automatically
disabled in those cases.
"""
stream_interval: int = Field(default=1, ge=1)