[Docs] Remove outdated async_scheduling limitation with speculative decoding (#32775)
Signed-off-by: Ifta Khairul Alam Adil <ikaadil007@gmail.com> Signed-off-by: Ifta khairul Alam Adil <25082512+ikaadil@users.noreply.github.com>
This commit is contained in:
committed by
GitHub
parent
24a163ed77
commit
a1d82466ea
@@ -133,9 +133,6 @@ class SchedulerConfig:
|
||||
async_scheduling: bool = Field(default=None)
|
||||
"""If set to False, disable async scheduling. Async scheduling helps to
|
||||
avoid gaps in GPU utilization, leading to better latency and throughput.
|
||||
It is currently not supported with some features such as
|
||||
speculative decoding and pipeline parallelism, and will be automatically
|
||||
disabled in those cases.
|
||||
"""
|
||||
|
||||
stream_interval: int = Field(default=1, ge=1)
|
||||
|
||||
Reference in New Issue
Block a user