Un-*** 50 billions multimodality dataset
☆23Sep 14, 2022Updated 3 years ago
Alternatives and similar repositories for laion50BU
Users that are interested in laion50BU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Mar 21, 2023Updated 3 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Jan 10, 2022Updated 4 years ago
- ☆65Oct 4, 2023Updated 2 years ago
- This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…☆14Jan 2, 2022Updated 4 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- A curated list of text-guided generative models resources☆158Nov 2, 2022Updated 3 years ago
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Mar 31, 2022Updated 3 years ago
- Directed masked autoencoders☆14Updated this week
- ☆12Jun 14, 2021Updated 4 years ago
- ☆27Mar 13, 2021Updated 5 years ago
- Optimized library for large-scale extraction of frames and audio from video.☆201Sep 11, 2023Updated 2 years ago
- Xfce Desktop container designed for direct access to the GPU with EGL using VirtualGL for GPUs. Does not require /tmp/.X11-unix host sock…☆10Jul 25, 2022Updated 3 years ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆320Dec 9, 2023Updated 2 years ago
- Pytorch、Numpy实现NMS、Soft-NMS代码☆12Mar 22, 2021Updated 5 years ago
- ☆23Dec 16, 2022Updated 3 years ago
- Easily compute clip embeddings from video frames☆145Oct 31, 2023Updated 2 years ago
- GPU controlled Hetzner Cloud workers swarm for Crawling@Home project☆58Oct 9, 2022Updated 3 years ago
- Simple python template☆43Apr 25, 2024Updated last year
- PyTorch code for MUST☆108May 1, 2025Updated 10 months ago
- clip retrieval benchmark☆17May 4, 2022Updated 3 years ago
- DataComp: In search of the next generation of multimodal datasets☆771Apr 28, 2025Updated 10 months ago
- Code of "Visualizing and Understanding Object Detecor"☆20Jun 24, 2021Updated 4 years ago
- ☆15Mar 6, 2025Updated last year
- ☆37Oct 21, 2022Updated 3 years ago
- ☆19Nov 4, 2025Updated 4 months ago
- JAX implementation ViT-VQGAN☆82Sep 21, 2022Updated 3 years ago
- [IJCV 2022] Information-Theoretic Odometry Learning☆16Apr 19, 2023Updated 2 years ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆29Aug 19, 2025Updated 7 months ago
- checkpoints for glide finetuned on laion and other datasets. wip.☆50Aug 17, 2022Updated 3 years ago
- Minimal JAX/Flax port of `lpips` supporting `vgg16`, with pre-trained weights stored in the 🤗 Hugging Face hub.☆17Aug 1, 2022Updated 3 years ago
- ☆18Jul 24, 2023Updated 2 years ago
- Simple notebooks to learn diffusion models on toy datasets☆17Feb 9, 2023Updated 3 years ago
- Official implementation of "DSP: Dual Soft-Paste for Unsupervised Domain Adaptive Semantic Segmentation"☆24Sep 8, 2021Updated 4 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆74May 16, 2022Updated 3 years ago
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆617Dec 13, 2022Updated 3 years ago
- COYO-700M: Large-scale Image-Text Pair Dataset☆1,251Nov 30, 2022Updated 3 years ago
- Training a model similar to OpenAI DALL-E with volunteers from all over the Internet using hivemind and dalle-pytorch (NeurIPS 2021 demo)☆27May 29, 2023Updated 2 years ago
- combination of OpenAI GLIDE and Latent Diffusion☆136Apr 7, 2022Updated 3 years ago
- A Benchmark for Efficient and Compositional Visual Reasoning☆25Aug 2, 2023Updated 2 years ago