docs/contributing/model/registration.md

---
title: Registering a Model
---
[](){ #new-model-registration }

vLLM relies on a model registry to determine how to run each model.
A list of pre-registered architectures can be found [here][supported-models].

If your model is not on this list, you must register it to vLLM.
This page provides detailed instructions on how to do so.

## Built-in models

To add a model directly to the vLLM library, start by forking our [GitHub repository](https://github.com/vllm-project/vllm) and then [build it from source][build-from-source].
This gives you the ability to modify the codebase and test your model.

After you have implemented your model (see [tutorial][new-model-basic]), put it into the <gh-dir:vllm/model_executor/models> directory.
Then, add your model class to `_VLLM_MODELS` in <gh-file:vllm/model_executor/models/registry.py> so that it is automatically registered upon importing vLLM.
Finally, update our [list of supported models][supported-models] to promote your model!

!!! important
    The list of models in each section should be maintained in alphabetical order.

## Out-of-tree models

You can load an external model [using a plugin][plugin-system] without modifying the vLLM codebase.

To register the model, use the following code:

```python
# The entrypoint of your plugin
def register():
    from vllm import ModelRegistry
    from your_code import YourModelForCausalLM

    ModelRegistry.register_model("YourModelForCausalLM", YourModelForCausalLM)
```

If your model imports modules that initialize CUDA, consider lazy-importing it to avoid errors like `RuntimeError: Cannot re-initialize CUDA in forked subprocess`:

```python
# The entrypoint of your plugin
def register():
    from vllm import ModelRegistry

    ModelRegistry.register_model(
        "YourModelForCausalLM",
        "your_code:YourModelForCausalLM"
    )
```

!!! important
    If your model is a multimodal model, ensure the model class implements the [SupportsMultiModal][vllm.model_executor.models.interfaces.SupportsMultiModal] interface.
    Read more about that [here][supports-multimodal].
Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00			`---`
[Doc] Rename page titles (#20130) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-06-26 23:18:49 +08:00			`title: Registering a Model`
Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00			`---`
			`[](){ #new-model-registration }`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00
			`vLLM relies on a model registry to determine how to run each model.`
Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00			`A list of pre-registered architectures can be found [here][supported-models].`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00
			`If your model is not on this list, you must register it to vLLM.`
			`This page provides detailed instructions on how to do so.`

			`## Built-in models`

Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00			`To add a model directly to the vLLM library, start by forking our [GitHub repository](https://github.com/vllm-project/vllm) and then [build it from source][build-from-source].`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00			`This gives you the ability to modify the codebase and test your model.`

Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00			`After you have implemented your model (see [tutorial][new-model-basic]), put it into the <gh-dir:vllm/model_executor/models> directory.`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00			Then, add your model class to `_VLLM_MODELS` in <gh-file:vllm/model_executor/models/registry.py> so that it is automatically registered upon importing vLLM.
Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00			`Finally, update our [list of supported models][supported-models] to promote your model!`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00
[Doc] Support "important" and "announcement" admonitions (#19479) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-06-11 16:39:58 +08:00			`!!! important`
Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00			`The list of models in each section should be maintained in alphabetical order.`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00
			`## Out-of-tree models`

[Doc] Update OOT model docs (#18742) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-05-27 14:30:31 +08:00			`You can load an external model [using a plugin][plugin-system] without modifying the vLLM codebase.`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00
			`To register the model, use the following code:`

			```python
[Doc] Update OOT model docs (#18742) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-05-27 14:30:31 +08:00			`# The entrypoint of your plugin`
			`def register():`
			`from vllm import ModelRegistry`
			`from your_code import YourModelForCausalLM`

			`ModelRegistry.register_model("YourModelForCausalLM", YourModelForCausalLM)`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00			```

			If your model imports modules that initialize CUDA, consider lazy-importing it to avoid errors like `RuntimeError: Cannot re-initialize CUDA in forked subprocess`:

			```python
[Doc] Update OOT model docs (#18742) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-05-27 14:30:31 +08:00			`# The entrypoint of your plugin`
			`def register():`
			`from vllm import ModelRegistry`

			`ModelRegistry.register_model(`
			`"YourModelForCausalLM",`
			`"your_code:YourModelForCausalLM"`
			`)`
[Doc][2/N] Reorganize Models and Usage sections (#11755) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-01-06 21:40:31 +08:00			```

[Doc] Support "important" and "announcement" admonitions (#19479) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> 2025-06-11 16:39:58 +08:00			`!!! important`
Migrate docs from Sphinx to MkDocs (#18145) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-23 11:09:53 +02:00			`If your model is a multimodal model, ensure the model class implements the [SupportsMultiModal][vllm.model_executor.models.interfaces.SupportsMultiModal] interface.`
			`Read more about that [here][supports-multimodal].`