update doc for online fp8 quantization (#37851)

Signed-off-by: Yan Ma <yan.ma@intel.com>
2026-03-23 13:19:03 +08:00
parent f85e479e66
commit d3fe857135
1 changed files with 0 additions and 3 deletions
--- a/docs/features/quantization/fp8.md
+++ b/docs/features/quantization/fp8.md
@@ -137,6 +137,3 @@ llm = LLM("facebook/opt-125m", quantization="fp8")
 result = llm.generate("Hello, my name is")
 print(result[0].outputs[0].text)
 ```
-
-!!! warning
-    Currently, we load the model at original precision before quantizing down to 8-bits, so you need enough memory to load the whole model.