Signed-off-by: Dipika Sikka <dipikasikka1@gmail.com> Co-authored-by: Michael Goin <mgoin64@gmail.com>
7 lines
222 B
Plaintext
7 lines
222 B
Plaintext
Qwen3-0.6B-FP8.yaml
|
|
Llama-3.2-1B-Instruct-INT8-CT.yaml
|
|
Llama-3-8B-Instruct-nonuniform-CT.yaml
|
|
Qwen2.5-VL-3B-Instruct-FP8-dynamic.yaml
|
|
Qwen1.5-MoE-W4A16-CT.yaml
|
|
DeepSeek-V2-Lite-Instruct-FP8.yaml
|
|
Qwen3-30B-A3B-MXFP4A16.yaml |