Files
vllm/tests/evals/gsm8k/configs/Nemotron-3-Super-120B-A12B-NVFP4.yaml
roikoren755 56777b5c89 [Test] E2E Nemotron-3-Super tests (#36803)
Signed-off-by: Roi Koren <roik@nvidia.com>
2026-03-23 17:49:56 -07:00

12 lines
332 B
YAML

model_name: "nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4"
accuracy_threshold: 0.93
num_questions: 1319
num_fewshot: 5
startup_max_wait_seconds: 1200
server_args: >-
--enforce-eager
--max-model-len 4096
--tensor-parallel-size 2
--enable-expert-parallel
--speculative-config '{"method":"mtp","num_speculative_tokens":5}'