Remove unnecessary explicit title anchors and use relative links instead (#20620)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
This commit is contained in:
@@ -1,7 +1,6 @@
|
||||
---
|
||||
title: Installation
|
||||
---
|
||||
[](){ #installation-index }
|
||||
|
||||
vLLM supports the following hardware platforms:
|
||||
|
||||
|
||||
@@ -109,8 +109,8 @@ docker run \
|
||||
|
||||
### Supported features
|
||||
|
||||
- [Offline inference][offline-inference]
|
||||
- Online serving via [OpenAI-Compatible Server][serving-openai-compatible-server]
|
||||
- [Offline inference](../../serving/offline_inference.md)
|
||||
- Online serving via [OpenAI-Compatible Server](../../serving/openai_compatible_server.md)
|
||||
- HPU autodetection - no need to manually select device within vLLM
|
||||
- Paged KV cache with algorithms enabled for Intel Gaudi accelerators
|
||||
- Custom Intel Gaudi implementations of Paged Attention, KV cache ops,
|
||||
|
||||
@@ -1,7 +1,6 @@
|
||||
---
|
||||
title: Quickstart
|
||||
---
|
||||
[](){ #quickstart }
|
||||
|
||||
This guide will help you quickly get started with vLLM to perform:
|
||||
|
||||
@@ -43,7 +42,7 @@ uv pip install vllm --torch-backend=auto
|
||||
```
|
||||
|
||||
!!! note
|
||||
For more detail and non-CUDA platforms, please refer [here][installation-index] for specific instructions on how to install vLLM.
|
||||
For more detail and non-CUDA platforms, please refer [here](installation/README.md) for specific instructions on how to install vLLM.
|
||||
|
||||
[](){ #quickstart-offline }
|
||||
|
||||
@@ -77,7 +76,7 @@ prompts = [
|
||||
sampling_params = SamplingParams(temperature=0.8, top_p=0.95)
|
||||
```
|
||||
|
||||
The [LLM][vllm.LLM] class initializes vLLM's engine and the [OPT-125M model](https://arxiv.org/abs/2205.01068) for offline inference. The list of supported models can be found [here][supported-models].
|
||||
The [LLM][vllm.LLM] class initializes vLLM's engine and the [OPT-125M model](https://arxiv.org/abs/2205.01068) for offline inference. The list of supported models can be found [here](../models/supported_models.md).
|
||||
|
||||
```python
|
||||
llm = LLM(model="facebook/opt-125m")
|
||||
|
||||
Reference in New Issue
Block a user