vllm/csrc/quantization at 3a922c1e7ee6753f41c6cc9d6d47d3b2d0110447 - vllm

Files

Robert Shaw 73c8d677e5 [Kernel] Marlin Expansion: Support AutoGPTQ Models with Marlin (#3922 )

Co-authored-by: alexm <alexm@neuralmagic.com>
Co-authored-by: mgoin <michael@neuralmagic.com>

2024-04-29 09:35:34 -07:00

AQLM CUDA support (#3287 )

2024-04-23 13:59:33 -04:00

2024-02-12 11:02:17 -08:00

2024-04-27 04:49:59 +00:00

2024-02-01 09:35:09 -08:00

2024-04-11 16:35:51 -04:00

2024-04-29 09:35:34 -07:00

2024-04-24 10:35:01 -07:00

2024-01-03 09:52:29 -08:00