Signed-off-by: jiahanc <173873397+jiahanc@users.noreply.github.com> Signed-off-by: Robert Shaw <robshaw@redhat.com> Co-authored-by: Robert Shaw <robshaw@redhat.com>
13 lines
546 B
Plaintext
13 lines
546 B
Plaintext
Qwen3-30B-A3B-NvFp4-CT-fi-cutedsl-deepep-ll.yaml
|
|
Qwen3-30B-A3B-NvFp4-ModelOpt-fi-cutedsl-deepep-ll.yaml
|
|
Qwen3-30B-A3B-NvFp4-CT-fi-cutlass.yaml
|
|
Qwen3-30B-A3B-NvFp4-ModelOpt-fi-trtllm.yaml
|
|
Qwen3-30B-A3B-NvFp4-ModelOpt-fi-cutlass.yaml
|
|
Qwen3-30B-A3B-Fp8-AutoFp8-deepgemm-deepep-ht.yaml
|
|
Qwen3-30B-A3B-Fp8-AutoFp8-deepgemm-deepep-ll.yaml
|
|
Qwen3-30B-A3B-Fp8-AutoFp8-deepgemm.yaml
|
|
Qwen3-30B-A3B-Fp8-CT-Block-deepgemm-deepep-ht.yaml
|
|
Qwen3-30B-A3B-Fp8-CT-Block-deepgemm-deepep-ll.yaml
|
|
Qwen3-30B-A3B-Fp8-CT-Block-deepgemm.yaml
|
|
Qwen3-30B-A3B-BF16-triton.yaml
|