Add and list supported models in README (#161)

2023-06-20 10:57:46 +08:00
parent 570fb2e9cc
commit 0b32a987dd
3 changed files with 15 additions and 1 deletions
--- a/README.md
+++ b/README.md
@@ -39,6 +39,13 @@ vLLM is flexible and easy to use with:
 - Streaming outputs
 - OpenAI-compatible API server

+vLLM seamlessly supports many Huggingface models, including the following architectures:
+
+- GPT-2 (e.g., `gpt2`, `gpt2-xl`, etc.)
+- GPTNeoX (e.g., `EleutherAI/gpt-neox-20b`, `databricks/dolly-v2-12b`, `stabilityai/stablelm-tuned-alpha-7b`, etc.)
+- LLaMA (e.g., `lmsys/vicuna-13b-v1.3`, `young-geng/koala`, `openlm-research/open_llama_13b`, etc.)
+- OPT (e.g., `facebook/opt-66b`, `facebook/opt-iml-max-30b`, etc.)
+
 Install vLLM with pip or [from source](https://llm-serving-cacheflow.readthedocs-hosted.com/en/latest/getting_started/installation.html#build-from-source):

 ```bash