[Doc]: fix typos in Python comments (#24294)

Signed-off-by: Didier Durand <durand.didier@gmail.com> Co-authored-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com>
2025-09-06 04:41:12 +02:00
parent 35efa70297
commit 35bf193864
12 changed files with 17 additions and 17 deletions
--- a/vllm/v1/attention/backends/flashinfer.py
+++ b/vllm/v1/attention/backends/flashinfer.py
@@ -687,7 +687,7 @@ class FlashInferImpl(AttentionImpl):
            else:
                raise ValueError(f"Unsupported output dtype: {output.dtype}")

-            # TRTLLM attn kernel requires o scale to pass as a host scalar,
+            # TRTLLM attn kernel requires to scale to pass as a host scalar,
            # store the o scale as a host scalar in warmup run with cuda graph
            # not enabled
            if layer._o_scale_float is None: