[Model] Qwen2.5-Omni Cleanup (#16872)

This commit is contained in:
Roger Wang
2025-04-19 02:37:02 -07:00
committed by GitHub
parent 83f3c3bd91
commit 5124f5bf51
2 changed files with 2 additions and 5 deletions

View File

@@ -1117,8 +1117,8 @@ Our PaliGemma implementations have the same problem as Gemma 3 (see above) for b
:::
:::{note}
To use Qwen2.5-Omni, you have to install a fork of Hugging Face Transformers library from source via
`pip install git+https://github.com/BakerBunker/transformers.git@qwen25omni`.
To use Qwen2.5-Omni, you have to install Hugging Face Transformers library from source via
`pip install git+https://github.com/huggingface/transformers.git`.
Read audio from video pre-processing is currently supported on V0 (but not V1), because overlapping modalities is not yet supported in V1.
`--mm-processor-kwargs '{"use_audio_in_video": True}'`.