docs/contributing/model/README.md

# Summary

!!! important
    Many decoder language models can now be automatically loaded using the [Transformers backend][transformers-backend] without having to implement them in vLLM. See if `vllm serve <model>` works first!

vLLM models are specialized [PyTorch](https://pytorch.org/) models that take advantage of various [features](../../features/compatibility_matrix.md) to optimize their performance.

The complexity of integrating a model into vLLM depends heavily on the model's architecture.
The process is considerably straightforward if the model shares a similar architecture with an existing model in vLLM.
However, this can be more complex for models that include new operators (e.g., a new attention mechanism).

Read through these pages for a step-by-step guide:

- [Basic Model](basic.md)
- [Registering a Model](registration.md)
- [Unit Testing](tests.md)
- [Multi-Modal Support](multimodal.md)

!!! tip
    If you are encountering issues while integrating your model into vLLM, feel free to open a [GitHub issue](https://github.com/vllm-project/vllm/issues)
    or ask on our [developer slack](https://slack.vllm.ai).
    We will be happy to help you out!
Stop using title frontmatter and fix doc that can only be reached by search (#20623) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-07-08 11:27:40 +01:00			`# Summary`
Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00
[Doc] Update docs for New Model Implementation (#20115) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-06-26 15:47:06 +08:00			`!!! important`
			Many decoder language models can now be automatically loaded using the [Transformers backend][transformers-backend] without having to implement them in vLLM. See if `vllm serve <model>` works first!
Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00
Remove unnecessary explicit title anchors and use relative links instead (#20620) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-07-08 10:49:13 +01:00			`vLLM models are specialized [PyTorch](https://pytorch.org/) models that take advantage of various [features](../../features/compatibility_matrix.md) to optimize their performance.`
Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00
[Doc] Update docs for New Model Implementation (#20115) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-06-26 15:47:06 +08:00			`The complexity of integrating a model into vLLM depends heavily on the model's architecture.`
			`The process is considerably straightforward if the model shares a similar architecture with an existing model in vLLM.`
			`However, this can be more complex for models that include new operators (e.g., a new attention mechanism).`
Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00
[Doc] Update docs for New Model Implementation (#20115) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-06-26 15:47:06 +08:00			`Read through these pages for a step-by-step guide:`

[Doc] Rename page titles (#20130) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-06-26 23:18:49 +08:00			`- [Basic Model](basic.md)`
			`- [Registering a Model](registration.md)`
			`- [Unit Testing](tests.md)`
[Doc] Update docs for New Model Implementation (#20115) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-06-26 15:47:06 +08:00			`- [Multi-Modal Support](multimodal.md)`
Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00
			`!!! tip`
			`If you are encountering issues while integrating your model into vLLM, feel free to open a [GitHub issue](https://github.com/vllm-project/vllm/issues)`
			`or ask on our [developer slack](https://slack.vllm.ai).`
			`We will be happy to help you out!`