Un-*** 50 billions multimodality dataset
☆23Sep 14, 2022Updated 3 years ago
Alternatives and similar repositories for laion50BU
Users that are interested in laion50BU are comparing it to the libraries listed below
Sorting:
- Directed masked autoencoders☆14Feb 20, 2026Updated last week
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- Simple large-scale training of stable diffusion with multi-node support.☆133May 8, 2023Updated 2 years ago
- ☆12Jun 14, 2021Updated 4 years ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Mar 21, 2023Updated 2 years ago
- ☆27Mar 13, 2021Updated 4 years ago
- A curated list of text-guided generative models resources☆158Nov 2, 2022Updated 3 years ago
- This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…☆14Jan 2, 2022Updated 4 years ago
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆33Jul 21, 2023Updated 2 years ago
- My daily arxiv reading note☆30Nov 10, 2021Updated 4 years ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆28Aug 19, 2025Updated 6 months ago
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Mar 31, 2022Updated 3 years ago
- ☆18Jul 24, 2023Updated 2 years ago
- A tool for generating awesome AI art☆16Jul 29, 2022Updated 3 years ago
- Minimal JAX/Flax port of `lpips` supporting `vgg16`, with pre-trained weights stored in the 🤗 Hugging Face hub.☆17Aug 1, 2022Updated 3 years ago
- Simple notebooks to learn diffusion models on toy datasets☆17Feb 9, 2023Updated 3 years ago
- PyTorch code for MUST☆108May 1, 2025Updated 10 months ago
- Code release of paper "ForkMerge: Mitigating Negative Transfer in Auxiliary-Task Learning" (NeurIPS 2023)☆17Dec 30, 2023Updated 2 years ago
- clip retrieval benchmark☆17May 4, 2022Updated 3 years ago
- ☆16Mar 22, 2024Updated last year
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆320Dec 9, 2023Updated 2 years ago
- JAX implementation ViT-VQGAN☆82Sep 21, 2022Updated 3 years ago
- ☆19Jul 24, 2023Updated 2 years ago
- Code for reproducing the experiments on large-scale pre-training and transfer learning for the paper "Effect of large-scale pre-training …☆19May 29, 2022Updated 3 years ago
- combination of OpenAI GLIDE and Latent Diffusion☆136Apr 7, 2022Updated 3 years ago
- ☆36Oct 9, 2025Updated 4 months ago
- Learning Features with Parameter-Free Layers, ICLR 2022☆84May 3, 2023Updated 2 years ago
- A conda-smithy repository for python-spams.☆23Nov 6, 2024Updated last year
- This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".☆25Dec 5, 2023Updated 2 years ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆39Sep 30, 2025Updated 5 months ago
- Code relative to "Adversarial robustness against multiple and single $l_p$-threat models via quick fine-tuning of robust classifiers"☆19Nov 30, 2022Updated 3 years ago
- checkpoints for glide finetuned on laion and other datasets. wip.☆50Aug 17, 2022Updated 3 years ago
- Optimized library for large-scale extraction of frames and audio from video.☆201Sep 11, 2023Updated 2 years ago
- When Dall E was a baby trained on a bit of data☆27Feb 26, 2021Updated 5 years ago
- VQVAE | VAE | GumbelVAE | PixelCNN☆21Jun 15, 2020Updated 5 years ago
- GPU controlled Hetzner Cloud workers swarm for Crawling@Home project☆58Oct 9, 2022Updated 3 years ago
- Dalle service☆51Nov 27, 2021Updated 4 years ago
- DataComp: In search of the next generation of multimodal datasets☆772Apr 28, 2025Updated 10 months ago
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆615Dec 13, 2022Updated 3 years ago