openai / CLIP-featurevisLinks
code for reproducing some of the diagrams in the paper "Multimodal Neurons in Artificial Neural Networks"
☆310Updated 4 years ago
Alternatives and similar repositories for CLIP-featurevis
Users that are interested in CLIP-featurevis are comparing it to the libraries listed below
Sorting:
- ☆235Updated 2 years ago
- Repository for "Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search"☆179Updated 4 years ago
- Code for making music videos using CLIP☆174Updated 4 years ago
- Official codebase for Pretrained Transformers as Universal Computation Engines.☆247Updated 3 years ago
- An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates concepts from neural fields, top-down-bottom-up proc…☆194Updated 4 years ago
- This is a summary of easily available datasets for generalized DALLE-pytorch training.☆128Updated 3 years ago
- JAX implementation of VQGAN☆91Updated 3 years ago
- ☆356Updated 3 years ago
- Contrastive Language-Image Pretraining☆143Updated 3 years ago
- Learning to ground explanations of affect for visual art.☆317Updated 4 years ago
- Open-AI's DALL-E for large scale training in mesh-tensorflow.☆433Updated 3 years ago
- Here is a collection of checkpoints for DALLE-pytorch models, from where you can keep on training or start generating images.☆146Updated 2 years ago
- Code for the paper, "Distribution Augmentation for Generative Modeling", ICML 2020.☆129Updated 2 years ago
- ☆87Updated 3 years ago
- MERLOT: Multimodal Neural Script Knowledge Models☆223Updated 3 years ago
- Code to train and evaluate the GeNeVA-GAN model for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Generating a…☆85Updated 3 years ago
- PyTorch Implementation of OpenAI's Image GPT☆260Updated 2 years ago
- ☆63Updated 3 years ago
- [CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations☆564Updated 2 months ago
- ☆150Updated 2 years ago
- PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"☆123Updated 4 years ago
- Trains Transformer model variants. Data isn't shuffled between batches.☆143Updated 3 years ago
- ☆160Updated 3 years ago
- Network-to-Network Translation with Conditional Invertible Neural Networks☆226Updated 2 years ago
- ☆196Updated 3 years ago
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆405Updated 3 months ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Updated 4 years ago
- Aim for the moon. If you miss, you may hit a star.☆163Updated 2 years ago
- Python Research Framework☆106Updated 2 years ago
- Modelverse: Content-Based Search for Deep Generative Models☆224Updated 10 months ago