Add GPTQ Marlin 2:4 sparse structured support (#4790)
Co-authored-by: Robert Shaw <rshaw@neuralmagic.com>
This commit is contained in:
committed by
GitHub
parent
9216b9cc38
commit
6979ade384
1110
csrc/quantization/marlin/sparse/marlin_24_cuda_kernel.cu
Normal file
1110
csrc/quantization/marlin/sparse/marlin_24_cuda_kernel.cu
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user