[Model] Support InternLM2 Reward models (#11571)
Signed-off-by: Isotr0py <2037008807@qq.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
This commit is contained in:
@@ -450,6 +450,11 @@ of the whole prompt are extracted from the normalized hidden state corresponding
|
||||
- Example HF Models
|
||||
- :ref:`LoRA <lora-adapter>`
|
||||
- :ref:`PP <distributed-serving>`
|
||||
* - :code:`InternLM2ForRewardModel`
|
||||
- InternLM2-based
|
||||
- :code:`internlm/internlm2-1_8b-reward`, :code:`internlm/internlm2-7b-reward`, etc.
|
||||
- ✅︎
|
||||
- ✅︎
|
||||
* - :code:`LlamaForCausalLM`
|
||||
- Llama-based
|
||||
- :code:`peiyi9979/math-shepherd-mistral-7b-prm`, etc.
|
||||
|
||||
Reference in New Issue
Block a user