[Model][VLM] Support multi-images inputs for InternVL2 models (#8201)

2024-09-07 16:38:23 +08:00
parent 9f68e00d27
commit e807125936
5 changed files with 199 additions and 57 deletions
--- a/docs/source/models/supported_models.rst
+++ b/docs/source/models/supported_models.rst
@@ -214,7 +214,7 @@ Multimodal Language Models
    - 
  * - :code:`InternVLChatModel`
    - InternVL2
-    - Image\ :sup:`E`
+    - Image\ :sup:`E+`
    - :code:`OpenGVLab/InternVL2-4B`, :code:`OpenGVLab/InternVL2-8B`, etc.
    - 
  * - :code:`LlavaForConditionalGeneration`