[Bugfix] Disable gptq_bitblas for <SM80 to fix GPTQ on V100/T4 (#17541)

Signed-off-by: mgoin <mgoin64@gmail.com>
This commit is contained in:
Michael Goin
2025-05-01 18:59:35 -06:00
committed by GitHub
parent 39c0813a7f
commit 24aebae177

View File

@@ -134,7 +134,7 @@ class GPTQBitBLASConfig(QuantizationConfig):
@classmethod
def get_min_capability(cls) -> int:
return 70
return 80
@classmethod
def get_config_filenames(cls) -> List[str]: