[Misc] Update qqq to use vLLMParameters (#7805)

This commit is contained in:
Dipika Sikka
2024-08-26 15:16:15 -04:00
committed by GitHub
parent 2deb029d11
commit 665304092d
3 changed files with 54 additions and 64 deletions

View File

@@ -17,4 +17,6 @@ awq, casperhansen/mixtral-instruct-awq, main
awq_marlin, casperhansen/mixtral-instruct-awq, main
fp8, neuralmagic/Meta-Llama-3-8B-Instruct-FP8-KV, main
marlin, nm-testing/zephyr-beta-7b-marlin-g128, main
marlin, robertgshaw2/zephyr-7b-beta-channelwise-marlin, main
marlin, robertgshaw2/zephyr-7b-beta-channelwise-marlin, main
qqq, HandH1998/QQQ-Llama-3-8b-g128, main
qqq, HandH1998/QQQ-Llama-3-8b, main