QwenLM / Qwen2.5-Omni
View external linksLinks

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
3,919Jun 12, 2025Updated 8 months ago

Alternatives and similar repositories for Qwen2.5-Omni

Users that are interested in Qwen2.5-Omni are comparing it to the libraries listed below

Sorting:

Are these results useful?