BAAI-DCAI / Bunny
A family of lightweight multimodal models.
☆972Updated last month
Alternatives and similar repositories for Bunny:
Users that are interested in Bunny are comparing it to the libraries listed below
- A Framework of Small-scale Large Multimodal Models☆709Updated 3 weeks ago
- Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks☆1,689Updated this week
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills☆720Updated 11 months ago
- LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer☆348Updated this week
- LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)☆758Updated 5 months ago
- Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation