vllm/tests/quantization at 9d6a8daa87e2e0af3ff45d03d08ad5a94ec089a8 - vllm

Files

Qubitium-ModelCloud ee93f4f92a [CORE] Quantized lm-head Framework (#4442 )

Co-authored-by: Robert Shaw <rshaw@neuralmagic.com>
Co-authored-by: ZX <zx@lbx.dev>

2024-07-02 22:25:17 +00:00

__init__.py

2024-05-13 23:50:09 +09:00

test_bitsandbytes.py

2024-06-13 15:18:08 +00:00

test_compressed_tensors.py

2024-06-30 23:06:27 +00:00

test_configs.py

2024-06-15 04:45:31 +00:00

test_fp8.py

2024-06-30 23:06:27 +00:00

test_lm_head.py

2024-07-02 22:25:17 +00:00

utils.py

2024-06-30 20:07:34 -07:00