[Docs] Move quant supported hardware table to README (#23663)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-08-26 23:26:46 +01:00
parent 2f13319f47
commit 6421b66bf4
3 changed files with 48 additions and 34 deletions
--- a/docs/features/quantization/bitblas.md
+++ b/docs/features/quantization/bitblas.md
@@ -5,7 +5,7 @@ vLLM now supports [BitBLAS](https://github.com/microsoft/BitBLAS) for more effic
 !!! note
    Ensure your hardware supports the selected `dtype` (`torch.bfloat16` or `torch.float16`).
    Most recent NVIDIA GPUs support `float16`, while `bfloat16` is more common on newer architectures like Ampere or Hopper.
-    For details see [supported hardware](supported_hardware.md).
+    For details see [supported hardware](README.md#supported-hardware).

 Below are the steps to utilize BitBLAS with vLLM.