This commit is contained in:
Junyang Lin
2024-01-23 06:34:21 +08:00
committed by GitHub
parent ab7e6006d6
commit 94b5edeb53
6 changed files with 344 additions and 2 deletions

View File

@@ -69,6 +69,9 @@ Alongside each architecture, we include some popular models that use it.
- Qwen
- :code:`Qwen/Qwen-7B`, :code:`Qwen/Qwen-7B-Chat`, etc.
* - :code:`StableLMEpochForCausalLM`
* - :code:`Qwen2ForCausalLM`
- Qwen2
- :code:`Qwen/Qwen2-7B-beta`, :code:`Qwen/Qwen-7B-Chat-beta`, etc.
- StableLM
- :code:`stabilityai/stablelm-3b-4e1t/` , :code:`stabilityai/stablelm-base-alpha-7b-v2`, etc.
* - :code:`YiForCausalLM`