Luodian / Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

☆3,215

Alternatives and similar repositories for Otter:

Users that are interested in Otter are comparing it to the libraries listed below

thunlp / UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
☆2,150Updated 10 months ago
OpenBMB / ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
☆4,848Updated 2 months ago
lyuchenyang / Macaw-LLM
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
☆1,518Updated 2 weeks ago
EvolvingLMMs-Lab / lmms-eval
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
☆1,987Updated this week
OpenBMB / BMTools
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
☆2,756Updated last year
Yuliang-Liu / Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
☆1,683Updated 2 weeks ago
mlfoundations / open_flamingo
An open-source framework for training large multimodal models.
☆3,801Updated 4 months ago
showlab / Show-1
[IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
☆927Updated 2 months ago
Alpha-VLLM / LLaMA2-Accessory
An Open-source Toolkit for LLM Development
☆2,747Updated this week
microsoft / i-Code
☆1,682Updated 3 months ago
X-PLUG / mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
☆2,393Updated last month
OpenGVLab / Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
☆3,141Updated last month
open-mmlab / Multimodal-GPT
Multimodal-GPT
☆1,488Updated last year
HITsz-TMG / UMOE-Scaling-Unified-Multimodal-LLMs
The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"
☆796Updated last week
MasterBin-IIAU / UNINEXT
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
☆1,252Updated last year
dvlab-research / LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
☆1,949Updated 2 weeks ago
dvlab-research / LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
☆2,635Updated 5 months ago
FreedomIntelligence / LLMZoo
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
☆2,923Updated last year
baaivision / Emu
Emu Series: Generative Multimodal Models from BAAI
☆1,673Updated 3 months ago
OpenBMB / AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides …
☆4,292Updated 4 months ago
lxtGH / OMG-Seg
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
☆1,199Updated last month
OpenBMB / CPM-Bee
百亿参数的中英文双语基座大模型
☆2,423Updated last year
OpenGVLab / LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,799Updated 10 months ago
Docta-ai / docta
A Doctor for your data
☆2,416Updated this week
YiVal / YiVal
Your Automatic Prompt Engineering Assistant for GenAI Applications
☆2,082Updated 8 months ago
AlaaLab / InstructCV
[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"
☆462Updated 8 months ago
OptimalScale / DetGPT
☆762Updated 5 months ago
InternLM / InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
☆2,709Updated 3 weeks ago
AILab-CVC / GPT4Tools
GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the u…
☆763Updated last year
shikras / shikra
☆756Updated 6 months ago