Ola: Pushing the Frontiers of Omni-Modal Language Model
☆387Jun 13, 2025Updated 8 months ago
Alternatives and similar repositories for Ola
Users that are interested in Ola are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] MLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution☆331Jul 4, 2025Updated 8 months ago
- ✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction☆2,494Mar 28, 2025Updated 11 months ago
- [CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models☆235Nov 7, 2025Updated 4 months ago
- Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos☆69Sep 5, 2025Updated 6 months ago
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception