docs/contributing/model/registration.md

# Registering a Model

vLLM relies on a model registry to determine how to run each model.
A list of pre-registered architectures can be found [here](../../models/supported_models.md).

If your model is not on this list, you must register it to vLLM.
This page provides detailed instructions on how to do so.

## Built-in models

To add a model directly to the vLLM library, start by forking our [GitHub repository](https://github.com/vllm-project/vllm) and then [build it from source](../../getting_started/installation/gpu.md#build-wheel-from-source).
This gives you the ability to modify the codebase and test your model.

After you have implemented your model (see [tutorial](basic.md)), put it into the [vllm/model_executor/models](../../../vllm/model_executor/models) directory.
Then, add your model class to `_VLLM_MODELS` in [vllm/model_executor/models/registry.py](../../../vllm/model_executor/models/registry.py) so that it is automatically registered upon importing vLLM.
Finally, update our [list of supported models](../../models/supported_models.md) to promote your model!

!!! important
    The list of models in each section should be maintained in alphabetical order.

## Out-of-tree models

You can load an external model [using a plugin](../../design/plugin_system.md) without modifying the vLLM codebase.

To register the model, use the following code:

```python
# The entrypoint of your plugin
def register():
    from vllm import ModelRegistry
    from your_code import YourModelForCausalLM

    ModelRegistry.register_model("YourModelForCausalLM", YourModelForCausalLM)
```

If your model imports modules that initialize CUDA, consider lazy-importing it to avoid errors like `RuntimeError: Cannot re-initialize CUDA in forked subprocess`:

```python
# The entrypoint of your plugin
def register():
    from vllm import ModelRegistry

    ModelRegistry.register_model(
        "YourModelForCausalLM",
        "your_code:YourModelForCausalLM",
    )
```

!!! important
    If your model is a multimodal model, ensure the model class implements the [SupportsMultiModal][vllm.model_executor.models.interfaces.SupportsMultiModal] interface.
    Read more about that [here](multimodal.md).
Stop using title frontmatter and fix doc that can only be reached by search (#20623) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-07-08 11:27:40 +01:00			`# Registering a Model`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00
			`vLLM relies on a model registry to determine how to run each model.`
Remove unnecessary explicit title anchors and use relative links instead (#20620) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-07-08 10:49:13 +01:00			`A list of pre-registered architectures can be found [here](../../models/supported_models.md).`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00
			`If your model is not on this list, you must register it to vLLM.`
			`This page provides detailed instructions on how to do so.`

			`## Built-in models`

[Docs] Replace all explicit anchors with real links (#27087) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-10-17 10:22:06 +01:00			`To add a model directly to the vLLM library, start by forking our [GitHub repository](https://github.com/vllm-project/vllm) and then [build it from source](../../getting_started/installation/gpu.md#build-wheel-from-source).`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00			`This gives you the ability to modify the codebase and test your model.`

[Docs] Reduce custom syntax used in docs (#27009) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-10-17 04:05:34 +01:00			`After you have implemented your model (see [tutorial](basic.md)), put it into the [vllm/model_executor/models](../../../vllm/model_executor/models) directory.`
			Then, add your model class to `_VLLM_MODELS` in [vllm/model_executor/models/registry.py](../../../vllm/model_executor/models/registry.py) so that it is automatically registered upon importing vLLM.
Remove unnecessary explicit title anchors and use relative links instead (#20620) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-07-08 10:49:13 +01:00			`Finally, update our [list of supported models](../../models/supported_models.md) to promote your model!`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00
[Doc] Support "important" and "announcement" admonitions (#19479) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-06-11 16:39:58 +08:00			`!!! important`
Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00			`The list of models in each section should be maintained in alphabetical order.`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00
			`## Out-of-tree models`

Remove unnecessary explicit title anchors and use relative links instead (#20620) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-07-08 10:49:13 +01:00			`You can load an external model [using a plugin](../../design/plugin_system.md) without modifying the vLLM codebase.`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00
			`To register the model, use the following code:`

			```python
[Doc] Update OOT model docs (#18742) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-05-27 14:30:31 +08:00			`# The entrypoint of your plugin`
			`def register():`
			`from vllm import ModelRegistry`
			`from your_code import YourModelForCausalLM`

			`ModelRegistry.register_model("YourModelForCausalLM", YourModelForCausalLM)`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00			```

			If your model imports modules that initialize CUDA, consider lazy-importing it to avoid errors like `RuntimeError: Cannot re-initialize CUDA in forked subprocess`:

			```python
[Doc] Update OOT model docs (#18742) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-05-27 14:30:31 +08:00			`# The entrypoint of your plugin`
			`def register():`
			`from vllm import ModelRegistry`

			`ModelRegistry.register_model(`
			`"YourModelForCausalLM",`
[Doc] ruff format some Python examples (#26767) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-10-14 18:21:53 +08:00			`"your_code:YourModelForCausalLM",`
[Doc] Update OOT model docs (#18742) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-05-27 14:30:31 +08:00			`)`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00			```

[Doc] Support "important" and "announcement" admonitions (#19479) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-06-11 16:39:58 +08:00			`!!! important`
Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00			`If your model is a multimodal model, ensure the model class implements the [SupportsMultiModal][vllm.model_executor.models.interfaces.SupportsMultiModal] interface.`
Remove unnecessary explicit title anchors and use relative links instead (#20620) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-07-08 10:49:13 +01:00			`Read more about that [here](multimodal.md).`