When do we not need larger vision models?
☆418Feb 8, 2025Updated last year
Alternatives and similar repositories for scaling_on_scales
Users that are interested in scaling_on_scales are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception☆158Dec 6, 2024Updated last year
- 【NeurIPS 2024】Dense Connector for MLLMs☆183Oct 14, 2024Updated last year
- ☆4,638Updated this week
- Cambrian-1 is a family of multimodal LLMs with a vision-centric design.☆1,993Nov 7, 2025Updated 5 months ago
- ☆126Jul 29, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,414Aug 4, 2025Updated 8 months ago
- [ICLR2025] LLaVA-HR: High-Resolution Large Language-Vision Assistant☆248Aug 14, 2024Updated last year
- Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model☆282Jun 25, 2024Updated last year
- EVE Series: Encoder-Free Vision-Language Models from BAAI☆368Jul 24, 2025Updated 8 months ago
- ☆360Jan 27, 2024Updated 2 years ago
- MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)☆324Jan 20, 2025Updated last year
- LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs☆419Dec 20, 2025Updated 3 months ago
- One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks☆4,032Updated this week
- A family of lightweight multimodal models.