[Model] Add OLMoE (#7922)

This commit is contained in:
Niklas Muennighoff
2024-09-20 09:31:41 -07:00
committed by GitHub
parent 260d40b5ea
commit 3b63de9353
3 changed files with 414 additions and 0 deletions

View File

@@ -127,6 +127,10 @@ Decoder-only Language Models
- Nemotron-3, Nemotron-4, Minitron
- :code:`nvidia/Minitron-8B-Base`, :code:`mgoin/Nemotron-4-340B-Base-hf-FP8`, etc.
- ✅︎
* - :code:`OLMoEForCausalLM`
- OLMoE
- :code:`allenai/OLMoE-1B-7B-0924`, :code:`allenai/OLMoE-1B-7B-0924-Instruct`, etc.
-
* - :code:`OLMoForCausalLM`
- OLMo
- :code:`allenai/OLMo-1B-hf`, :code:`allenai/OLMo-7B-hf`, etc.