code for reproducing some of the diagrams in the paper "Multimodal Neurons in Artificial Neural Networks"
☆309Mar 21, 2021Updated 4 years ago
Alternatives and similar repositories for CLIP-featurevis
Users that are interested in CLIP-featurevis are comparing it to the libraries listed below
Sorting:
- Code associated with our paper "Learning Group Structure and Disentangled Representations of Dynamical Environments"☆15Dec 8, 2022Updated 3 years ago
- ☆2,083Apr 29, 2022Updated 3 years ago
- [CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations☆565Aug 22, 2025Updated 6 months ago
- PyTorch package for the discrete VAE used for DALL·E.☆10,873Jan 31, 2024Updated 2 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- ☆221Jun 8, 2020Updated 5 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"☆20Nov 12, 2021Updated 4 years ago
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆729Aug 8, 2023Updated 2 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆102Nov 2, 2020Updated 5 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,295Mar 3, 2024Updated 2 years ago
- Chef cookbooks for managing a Ceph cluster☆11Apr 2, 2023Updated 2 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 6 months ago
- Retryable HTTP client in Go☆13Apr 2, 2023Updated 2 years ago
- Repository for "Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search"☆179Sep 30, 2021Updated 4 years ago
- Flexible Feature visualization on PyTorch, for research and art☆245Mar 17, 2025Updated 11 months ago
- Fluentd output plugin that sends events to Amazon Kinesis Streams and Amazon Kinesis Firehose.☆12Apr 2, 2023Updated 2 years ago
- WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique imag…☆1,100Sep 27, 2024Updated last year
- Experiments with Neural ODEs and Adversarial Attacks☆44Jan 13, 2019Updated 7 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- Oscar and VinVL☆1,053Aug 28, 2023Updated 2 years ago
- PyTorch code for EMNLP 2020 Paper "Vokenization: Improving Language Understanding with Visual Supervision"☆192Mar 8, 2021Updated 4 years ago
- Repository for the paper "Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images"☆451Apr 28, 2023Updated 2 years ago
- ☆124Jun 16, 2022Updated 3 years ago
- MoPro: Webly Supervised Learning☆88May 1, 2025Updated 10 months ago
- A tutorial example for nbdev☆15Feb 26, 2022Updated 4 years ago
- Implementations of GANs in Tensorflow 2.x☆15Feb 12, 2022Updated 4 years ago
- [ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383☆421Oct 28, 2022Updated 3 years ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,642Feb 18, 2026Updated 2 weeks ago
- ☆28Jul 21, 2021Updated 4 years ago
- Open-AI's DALL-E for large scale training in mesh-tensorflow.☆431Feb 12, 2022Updated 4 years ago
- GLIDE: a diffusion-based text-conditional image synthesis model☆3,686Mar 8, 2024Updated last year
- Generate a denotation graph from a set of image captions☆15Sep 4, 2018Updated 7 years ago
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆415Jul 14, 2025Updated 7 months ago
- Generative Adversarial Transformers☆1,345Jun 14, 2022Updated 3 years ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,610Aug 12, 2020Updated 5 years ago
- Lifelong Variational Autoencoder☆15Dec 6, 2017Updated 8 years ago
- Code for the paper "Understanding RL Vision"☆50Apr 2, 2023Updated 2 years ago
- Code for the NeurIPS 2019 paper: "Learning Dynamics of Attention: Human Prior for Interpretable Machine Reasoning"☆33Jun 27, 2023Updated 2 years ago