weiyx16 / CLIP-pytorchLinks
A non-JIT version implementation / replication of CLIP of OpenAI in pytorch
☆34Updated 4 years ago
Alternatives and similar repositories for CLIP-pytorch
Users that are interested in CLIP-pytorch are comparing it to the libraries listed below
Sorting:
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆59Updated 4 years ago
- PyTorch implementation of Contrastive Feature Loss for Image Prediction (AIM Workshop at ICCV 2021)☆54Updated 3 years ago
- ICCV2021 (poster)☆74Updated 3 years ago
- ☆17Updated 2 years ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Updated 4 years ago
- Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch☆59Updated 4 years ago
- Official repository for MaGNET, ICLR 2022☆24Updated 3 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Updated 3 years ago
- Audio-conditioned video texture generation☆24Updated 3 years ago
- ☆97Updated 2 months ago
- Official code repository for Instance Selection for GANs.☆44Updated 4 years ago
- An open source implementation of CLIP.☆33Updated 2 years ago
- [NeurIPS'21] "Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly", Tianlong Chen, Yu Cheng, Zhe …☆84Updated 3 years ago
- ☆36Updated 3 years ago
- This is the official repository for CookGAN: Meal Image Synthesis from Ingredients☆23Updated 2 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Updated 5 years ago
- Linear image-to-image translation☆41Updated 5 years ago
- CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification - 4th Workshop on Computer Vision for Fashion, Art, and Design☆28Updated 3 years ago
- Implementation of Taming Transformers for High-Resolution Image Synthesis (https://arxiv.org/abs/2012.09841) in PyTorch☆16Updated 4 years ago
- ☆63Updated 3 years ago
- JAX implementation ViT-VQGAN☆82Updated 3 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆97Updated 2 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆109Updated 6 months ago
- Un-*** 50 billions multimodality dataset☆22Updated 3 years ago
- SuperStyleNet: Deep Image Synthesis with Superpixel Based Style Encoder (BMVC 2021)☆27Updated 3 years ago
- [ACM MM 2020] DWC-GAN.☆33Updated 4 years ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆102Updated 3 years ago
- codebase for the SIMAT dataset and evaluation☆38Updated 3 years ago
- [ICLR'23] Code to reproduce the results in the paper "PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs"☆58Updated 2 years ago
- ☆28Updated 3 years ago