vllm/docs/source/features/quantization at 2549c0dfef2f897d19eb8aa4294b3b8419ce078d - vllm

Files

Tristan Leclercq 4285e423a6 [Misc] Auto detect bitsandbytes pre-quantized models (#16027 )

Signed-off-by: Tristan Leclercq <tristanleclercq@gmail.com>

2025-04-04 23:30:45 -07:00

auto_awq.md

2025-03-03 21:59:09 +00:00

bnb.md

2025-04-04 23:30:45 -07:00

fp8.md

2025-01-29 11:38:29 +08:00

gguf.md

2025-02-27 02:08:35 -08:00

gptqmodel.md

2025-03-03 21:59:09 +00:00

index.md

2025-04-01 08:32:45 -07:00

int4.md

2025-01-31 15:38:48 -08:00

int8.md

2025-01-31 15:38:48 -08:00

quantized_kvcache.md

2025-01-23 18:04:03 +00:00

quark.md

2025-04-01 08:32:45 -07:00

supported_hardware.md

2025-02-18 18:52:39 +08:00