kakaobrain / coyo-datasetLinks

COYO-700M: Large-scale Image-Text Pair Dataset

☆1,240

Alternatives and similar repositories for coyo-dataset

Users that are interested in coyo-dataset are comparing it to the libraries listed below

Sorting:

lucidrains / x-clip
A concise but complete implementation of CLIP with various experimental improvements from recent papers
☆716Updated 2 years ago
kakaobrain / karlo
☆699Updated 2 years ago
lucidrains / flamingo-pytorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
☆1,266Updated 3 years ago
lucidrains / muse-maskgit-pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
☆912Updated last year
mlfoundations / datacomp
DataComp: In search of the next generation of multimodal datasets
☆742Updated 5 months ago
kakaobrain / mindall-e
PyTorch implementation of a 1.3B text-to-image generation model trained on 14 million image-text pairs
☆636Updated 3 years ago
lucidrains / parti-pytorch
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
☆541Updated last year
SHI-Labs / Versatile-Diffusion
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
☆1,335Updated 2 years ago
kakaobrain / rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
☆956Updated last year
adobe-research / custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
☆1,970Updated last year
google-research / pix2seq
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
☆930Updated last year
microsoft / VQ-Diffusion
Official implementation of VQ-Diffusion
☆965Updated last year
kohjingyu / fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
☆481Updated last year
microsoft / GenerativeImage2Text
GIT: A Generative Image-to-text Transformer for Vision and Language
☆574Updated last year
lucidrains / CoCa-pytorch
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
☆1,181Updated last year
m-bain / webvid
Large-scale text-video dataset. 10 million captioned short videos.
☆658Updated last year
huggingface / open-muse
Open reproduction of MUSE for fast text2image generation.
☆356Updated last year
google-research-datasets / wit
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique imag…
☆1,078Updated last year
facebookresearch / SLIP
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆782Updated 2 years ago
google-research-datasets / conceptual-12m
Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.
☆403Updated 3 months ago
LAION-AI / CLIP_benchmark
CLIP-like model evaluation
☆776Updated 2 months ago
poloclub / diffusiondb
A large-scale text-to-image prompt gallery dataset based on Stable Diffusion
☆1,313Updated last year
FreddeFrallan / Multilingual-CLIP
OpenAI CLIP text encoders for multiple languages!
☆812Updated 2 years ago
facebookresearch / ToMe
A method to increase the speed and lower the memory footprint of existing vision transformers.
☆1,108Updated last year
google-research / magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
☆987Updated last year
allenai / mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
☆940Updated 6 months ago
Zasder3 / train-CLIP
A PyTorch Lightning solution to training OpenAI's CLIP from scratch.
☆713Updated 3 years ago
dome272 / Paella
Official Implementation of Paella https://arxiv.org/abs/2211.07292v2
☆747Updated 2 years ago
zai-org / CogView2
official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"
☆955Updated 3 years ago
kjsman / stable-diffusion-pytorch
Yet another PyTorch implementation of Stable Diffusion (probably easy to read)
☆591Updated last year