allenai / mmc4Links

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

☆933

Alternatives and similar repositories for mmc4

Users that are interested in mmc4 are comparing it to the libraries listed below

Sorting:

kohjingyu / fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
☆482Updated last year
luogen1996 / LaVIN
[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"
☆521Updated last year
mlfoundations / datacomp
DataComp: In search of the next generation of multimodal datasets
☆724Updated 2 months ago
princeton-nlp / MeZO
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
☆1,116Updated last year
lupantech / ScienceQA
Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".
☆680Updated 10 months ago
lucidrains / flamingo-pytorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
☆1,249Updated 2 years ago
facebookresearch / MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expert…
☆1,478Updated last week
AILab-CVC / GPT4Tools
GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the u…
☆775Updated last year
salesforce / xgen
Salesforce open-source LLMs with 8k sequence length.
☆720Updated 5 months ago
eric-ai-lab / MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
☆861Updated 2 months ago
mlfoundations / MINT-1T
MINT-1T: A one trillion token multimodal interleaved dataset.
☆819Updated 11 months ago
abertsch72 / unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
☆1,062Updated last year
microsoft / MM-REACT
Official repo for MM-REACT
☆953Updated last year
allenai / unified-io-2
☆616Updated last year
baaivision / Emu
Emu Series: Generative Multimodal Models from BAAI
☆1,736Updated 9 months ago
lucidrains / MEGABYTE-pytorch
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
☆646Updated 6 months ago
OpenLMLab / LOMO
LOMO: LOw-Memory Optimization
☆988Updated last year
kyegomez / CM3Leon
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal …
☆362Updated last year
ContextualAI / lens
This is the official repository for the LENS (Large Language Models Enhanced to See) system.
☆352Updated last year
LAION-AI / CLIP_benchmark
CLIP-like model evaluation
☆740Updated last month
mbzuai-nlp / LaMini-LM
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆820Updated 2 years ago
InternLM / InternLM-techreport
☆905Updated 2 years ago
GanjinZero / RRHF
[NIPS2023] RRHF & Wombat
☆809Updated last year
LLaVA-VL / LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
☆751Updated last year
tatsu-lab / alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
☆818Updated last year
IBM / Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
☆1,148Updated 2 months ago
kohjingyu / gill
🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".
☆459Updated last year
sail-sg / lorahub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
☆640Updated last year
persimmon-ai-labs / adept-inference
Inference code for Persimmon-8B
☆415Updated last year
Shark-NLP / OpenICL
OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.
☆569Updated last year