[V1] Extend beyond image modality and support mixed-modality inference with Llava-OneVision (#11685)
Signed-off-by: Roger Wang <ywang@roblox.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>
This commit is contained in:
@@ -647,7 +647,7 @@ See [this page](#generative-models) for more information on how to use generativ
|
||||
- `llava-hf/llava-onevision-qwen2-7b-ov-hf`, `llava-hf/llava-onevision-qwen2-0.5b-ov-hf`, etc.
|
||||
-
|
||||
- ✅︎
|
||||
-
|
||||
- ✅︎
|
||||
* - `MiniCPMV`
|
||||
- MiniCPM-V
|
||||
- T + I<sup>E+</sup>
|
||||
|
||||
Reference in New Issue
Block a user