[Misc][Kernel]: Add GPTQAllSpark Quantization (#12931)
This commit is contained in:
1008
csrc/quantization/gptq_allspark/allspark_qgemm_w8a16.cu
Normal file
1008
csrc/quantization/gptq_allspark/allspark_qgemm_w8a16.cu
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user