vllm/docs/deployment/frameworks/triton.md at b4bab81660a184693543ca9261ced745db1fc2a7

Files

Harry Mellor b4bab81660 Remove unnecessary explicit title anchors and use relative links instead (#20620 )

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2025-07-08 02:49:13 -07:00

title

title
NVIDIA Triton

The Triton Inference Server hosts a tutorial demonstrating how to quickly deploy a simple facebook/opt-125m model using vLLM. Please see Deploying a vLLM model in Triton for more details.