[CI] Bump num_speculative_tokens to 3 in nightly DeepSeek tests (#35882)

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>
This commit is contained in:
Matthew Bonanni
2026-03-03 12:26:44 -05:00
committed by GitHub
parent ae88468bcc
commit 8e1fd5baf0
4 changed files with 4 additions and 4 deletions

View File

@@ -8,4 +8,4 @@ server_args: >-
--max-model-len 4096
--tensor-parallel-size 8
--enable-expert-parallel
--speculative-config '{"method":"mtp","num_speculative_tokens":1}'
--speculative-config '{"method":"mtp","num_speculative_tokens":3}'