lyuchenyang / Macaw-LLMLinks

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

☆1,579

Alternatives and similar repositories for Macaw-LLM

Users that are interested in Macaw-LLM are comparing it to the libraries listed below

Sorting:

EvolvingLMMs-Lab / Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing imp…
☆3,272Updated last year
dvlab-research / LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
☆2,685Updated last year
yxuansu / PandaGPT
[TLLM'23] PandaGPT: One Model To Instruction-Follow Them All
☆822Updated 2 years ago
Alpha-VLLM / LLaMA2-Accessory
An Open-source Toolkit for LLM Development
☆2,789Updated 9 months ago
AILab-CVC / GPT4Tools
GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the u…
☆772Updated last year
eric-ai-lab / MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
☆861Updated 5 months ago
OpenGVLab / Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
☆3,305Updated 8 months ago
0nutation / SpeechGPT
SpeechGPT Series: Speech Large Language Models
☆1,386Updated last year
microsoft / i-Code
☆1,707Updated last year
DAMO-NLP-SG / Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
☆3,079Updated last year
open-mmlab / Multimodal-GPT
Multimodal-GPT
☆1,507Updated 2 years ago
baaivision / Emu
Emu Series: Generative Multimodal Models from BAAI
☆1,742Updated last year
NExT-GPT / NExT-GPT
Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model
☆3,568Updated 5 months ago
CStanKonrad / long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…
☆1,460Updated last year
X-PLUG / mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
☆2,522Updated 6 months ago
OpenBMB / VisCPM
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
☆1,068Updated last year
magic-research / bubogpt
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
☆510Updated 2 years ago
FreedomIntelligence / LLMZoo
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
☆2,940Updated last year
OpenBMB / BMTools
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
☆2,788Updated last year
microsoft / MM-REACT
Official repo for MM-REACT
☆957Updated last year
luogen1996 / LaVIN
[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"
☆523Updated last year
OpenMOSS / AnyGPT
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
☆861Updated last year
OpenBMB / ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
☆5,276Updated 4 months ago
Victorwz / LongMem
Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".
☆809Updated last year
LLaVA-VL / LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
☆758Updated last year
showlab / VLog
[CVPR 2025] Video Narration as Vocabulary & Video as Long Document
☆578Updated 7 months ago
PKU-YuanGroup / MoE-LLaVA
【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models
☆2,250Updated 3 months ago
PKU-YuanGroup / LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
☆831Updated last year
BAAI-DCAI / Bunny
A family of lightweight multimodal models.
☆1,044Updated 10 months ago
VITA-MLLM / Woodpecker
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models
☆639Updated 9 months ago