christophschuhmann / 4MC-4M-Image-Text-Pairs-with-CLIP-embeddings
I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived from YFCC100M. I have also added propabilities from a NSFW detector & more.
☆15Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for 4MC-4M-Image-Text-Pairs-with-CLIP-embeddings
- Script and models for clustering LAION-400m CLIP embeddings.☆25Updated 2 years ago
- ☆21Updated 3 years ago
- Describe the format of image/text datasets☆11Updated 2 years ago
- ☆0Updated last year
- A CLIP conditioned Decision Transformer.☆22Updated 3 years ago
- OpenAI CLIP based image generator with complex config file controlled transformation and training pipelines☆18Updated 2 years ago
- ☆30Updated 2 years ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Official repository for the paper "Images as Weight Matrices: Sequential Image Generation Through Synaptic Learning Rules" (ICLR 2023)☆12Updated last year
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- codebase for the SIMAT dataset and evaluation☆38Updated 2 years ago
- Generate images from texts. In Russian☆19Updated 2 years ago
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Updated last year
- Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.☆35Updated 2 years ago
- Implementation of Metaformer, but in an autoregressive manner☆23Updated 2 years ago
- Guide diffusion on ImageBind embedding similarity☆27Updated last year
- ☆20Updated 8 months ago
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆34Updated 3 years ago
- CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell☆14Updated last year
- Colab notebook to finetune GLIDE.☆12Updated 2 years ago
- ☆14Updated 3 years ago
- Kaggle fashion dataset in dalle format☆13Updated 3 years ago
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆59Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- When Dall E was a baby trained on a bit of data☆25Updated 3 years ago
- Aggregating embeddings over time☆31Updated last year
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 2 years ago
- GET3D online data renderer☆11Updated last year
- Implementation of Analyzing and Improving the Image Quality of StyleGAN (StyleGAN 2) in PyTorch☆10Updated 3 years ago