weiyx16 / CLIP-pytorch
A non-JIT version implementation / replication of CLIP of OpenAI in pytorch
☆34Updated 4 years ago
Alternatives and similar repositories for CLIP-pytorch:
Users that are interested in CLIP-pytorch are comparing it to the libraries listed below
- An open source implementation of CLIP.☆32Updated 2 years ago
- PyTorch implementation of Contrastive Feature Loss for Image Prediction (AIM Workshop at ICCV 2021)☆53Updated 3 years ago
- ICCV2021 (poster)☆74Updated 3 years ago
- ☆17Updated 2 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆59Updated 4 years ago
- Single Image Texture Translation for Data Augmentation☆61Updated 2 years ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Updated 3 years ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- ☆98Updated 5 months ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆99Updated 2 years ago
- A unified framework to jointly model images, text, and human attention traces.☆78Updated 3 years ago
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆33Updated 3 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆108Updated last week
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Evaluation benchmark for the task of Semantic Image Translation. Contains code to run FlexIT (CVPR 2022)☆35Updated 3 years ago
- ☆26Updated 3 years ago
- clip retrieval benchmark☆17Updated 2 years ago
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Updated 3 years ago
- Invert and perturb GAN images for test-time ensembling☆96Updated 3 years ago
- ☆21Updated 4 years ago
- ☆57Updated 4 years ago
- ☆28Updated 3 years ago
- Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch☆57Updated 4 years ago
- [NeurIPS'21] "Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly", Tianlong Chen, Yu Cheng, Zhe …☆84Updated 3 years ago
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆118Updated 3 years ago
- Audio-conditioned video texture generation☆24Updated 2 years ago
- Release of ImageNet-Captions☆47Updated 2 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 3 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆35Updated 3 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆46Updated last year