[Docs] Switch to better markdown linting pre-commit hook (#21851)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-07-30 03:45:08 +01:00
parent 555e7225bc
commit ba5c5e5404
55 changed files with 273 additions and 198 deletions
--- a/docs/usage/v1_guide.md
+++ b/docs/usage/v1_guide.md
@@ -148,7 +148,7 @@ are not yet supported.
 vLLM V1 supports logprobs and prompt logprobs. However, there are some important semantic
 differences compared to V0:

-**Logprobs Calculation**
+##### Logprobs Calculation

 Logprobs in V1 are now returned immediately once computed from the model’s raw output (i.e.
 before applying any logits post-processing such as temperature scaling or penalty
@@ -157,7 +157,7 @@ probabilities used during sampling.

 Support for logprobs with post-sampling adjustments is in progress and will be added in future updates.

-**Prompt Logprobs with Prefix Caching**
+##### Prompt Logprobs with Prefix Caching

 Currently prompt logprobs are only supported when prefix caching is turned off via `--no-enable-prefix-caching`. In a future release, prompt logprobs will be compatible with prefix caching, but a recomputation will be triggered to recover the full prompt logprobs even upon a prefix cache hit. See details in [RFC #13414](gh-issue:13414).

@@ -165,7 +165,7 @@ Currently prompt logprobs are only supported when prefix caching is turned off v

 As part of the major architectural rework in vLLM V1, several legacy features have been deprecated.

-**Sampling features**
+##### Sampling features

 - **best_of**: This feature has been deprecated due to limited usage. See details at [RFC #13361](gh-issue:13361).
 - **Per-Request Logits Processors**: In V0, users could pass custom
@@ -173,11 +173,11 @@ As part of the major architectural rework in vLLM V1, several legacy features ha
  feature has been deprecated. Instead, the design is moving toward supporting **global logits
  processors**, a feature the team is actively working on for future releases. See details at [RFC #13360](gh-pr:13360).

-**KV Cache features**
+##### KV Cache features

 - **GPU <> CPU KV Cache Swapping**: with the new simplified core architecture, vLLM V1 no longer requires KV cache swapping
 to handle request preemptions.

-**Structured Output features**
+##### Structured Output features

 - **Request-level Structured Output Backend**: Deprecated, alternative backends (outlines, guidance) with fallbacks is supported now.