[ Kernel ] Enable fp8-marlin for fbgemm-fp8 models (#6606)

This commit is contained in:
Robert Shaw
2024-07-20 14:50:10 -04:00
committed by GitHub
parent 06d6c5fe9f
commit 9364f74eee
4 changed files with 44 additions and 3 deletions

View File

@@ -1,3 +1,4 @@
Meta-Llama-3-70B-Instruct-FBGEMM-nonuniform.yaml
Meta-Llama-3-70B-Instruct.yaml
Mixtral-8x7B-Instruct-v0.1.yaml
Qwen2-57B-A14-Instruct.yaml