Integrate Marlin Kernels for Int4 GPTQ inference (#2497)

Co-authored-by: Robert Shaw <114415538+rib-2@users.noreply.github.com>
Co-authored-by: alexm <alexm@neuralmagic.com>
This commit is contained in:
Robert Shaw
2024-03-01 14:47:51 -06:00
committed by GitHub
parent 90fbf12540
commit c0c2335ce0
12 changed files with 1752 additions and 6 deletions

File diff suppressed because it is too large Load Diff