From e68de8adc0301babb3bb3fcd2ddccaf98e7695c8 Mon Sep 17 00:00:00 2001 From: Xiang Shi Date: Fri, 6 Mar 2026 14:01:02 +0800 Subject: [PATCH] docs: fix wrong cc in int8.md (#36209) Signed-off-by: Xiang Shi --- docs/features/quantization/int8.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/features/quantization/int8.md b/docs/features/quantization/int8.md index 18965aed3..53a5e7506 100644 --- a/docs/features/quantization/int8.md +++ b/docs/features/quantization/int8.md @@ -9,7 +9,7 @@ Please visit the HF collection of [quantized INT8 checkpoints of popular LLMs re INT8 computation is supported on NVIDIA GPUs with compute capability > 7.5 (Turing, Ampere, Ada Lovelace, Hopper). !!! warning - **Blackwell GPU Limitation**: INT8 is not supported on compute capability >= 100 (e.g., RTX 6000 Blackwell). + **Blackwell GPU Limitation**: INT8 is not supported on compute capability >= 10.0 (e.g., RTX 6000 Blackwell). Use [FP8 quantization](fp8.md) instead, or run on Hopper/Ada/Ampere architectures. ## Prerequisites