vllm/docs/source/features/quantization at 58d1b2aa772deb166355423997fbf5c1b6b186a1 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Szymon Ożóg 7f0be2aa24 [Model] Deepseek GGUF support (#13167 )

2025-02-27 02:08:35 -08:00

..

auto_awq.md

[Doc] Remove performance warning for auto_awq.md (#12743 )

2025-02-04 22:43:11 -08:00

bnb.md

[CI/Build] Add markdown linter (#11857 )

2025-01-12 00:17:13 -08:00

fp8.md

[Doc] Convert docs to use colon fences (#12471 )

2025-01-29 11:38:29 +08:00

gguf.md

[Model] Deepseek GGUF support (#13167 )

2025-02-27 02:08:35 -08:00

index.md

[Doc] int4 w4a16 example (#12585 )

2025-01-31 15:38:48 -08:00

int4.md

[Doc] int4 w4a16 example (#12585 )

2025-01-31 15:38:48 -08:00

int8.md

[Doc] int4 w4a16 example (#12585 )

2025-01-31 15:38:48 -08:00

quantized_kvcache.md

[FP8][Kernel] Dynamic kv cache scaling factors computation (#11906 )

2025-01-23 18:04:03 +00:00

supported_hardware.md

[Doc]: Improve feature tables (#13224 )

2025-02-18 18:52:39 +08:00