Remove unnecessary explicit title anchors and use relative links instead (#20620)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
This commit is contained in:
@@ -1,7 +1,6 @@
|
||||
---
|
||||
title: Offline Inference
|
||||
---
|
||||
[](){ #offline-inference }
|
||||
|
||||
Offline inference is possible in your own code using vLLM's [`LLM`][vllm.LLM] class.
|
||||
|
||||
@@ -18,8 +17,8 @@ llm = LLM(model="facebook/opt-125m")
|
||||
After initializing the `LLM` instance, use the available APIs to perform model inference.
|
||||
The available APIs depend on the model type:
|
||||
|
||||
- [Generative models][generative-models] output logprobs which are sampled from to obtain the final output text.
|
||||
- [Pooling models][pooling-models] output their hidden states directly.
|
||||
- [Generative models](../models/generative_models.md) output logprobs which are sampled from to obtain the final output text.
|
||||
- [Pooling models](../models/pooling_models.md) output their hidden states directly.
|
||||
|
||||
!!! info
|
||||
[API Reference][offline-inference-api]
|
||||
|
||||
Reference in New Issue
Block a user