[Model] Support InternLM2 Reward models (#11571)

Signed-off-by: Isotr0py <2037008807@qq.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
This commit is contained in:
Isotr0py
2024-12-28 14:14:10 +08:00
committed by GitHub
parent b5cbe8eeb3
commit d34be24bb1
4 changed files with 67 additions and 1 deletions

View File

@@ -450,6 +450,11 @@ of the whole prompt are extracted from the normalized hidden state corresponding
- Example HF Models
- :ref:`LoRA <lora-adapter>`
- :ref:`PP <distributed-serving>`
* - :code:`InternLM2ForRewardModel`
- InternLM2-based
- :code:`internlm/internlm2-1_8b-reward`, :code:`internlm/internlm2-7b-reward`, etc.
- ✅︎
- ✅︎
* - :code:`LlamaForCausalLM`
- Llama-based
- :code:`peiyi9979/math-shepherd-mistral-7b-prm`, etc.