klory / CookGAN
This is the official repository for CookGAN: Meal Image Synthesis from Ingredients
☆23Updated 2 years ago
Alternatives and similar repositories for CookGAN:
Users that are interested in CookGAN are comparing it to the libraries listed below
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 4 years ago
- PyTorch implementation of Contrastive Feature Loss for Image Prediction (AIM Workshop at ICCV 2021)☆53Updated 3 years ago
- RG-UNIT, ACM MM 2020.☆10Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 3 years ago
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…☆14Updated 3 years ago
- CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification - 4th Workshop on Computer Vision for Fashion, Art, and Design☆27Updated 2 years ago
- ☆17Updated 2 years ago
- Implementation of Taming Transformers for High-Resolution Image Synthesis (https://arxiv.org/abs/2012.09841) in PyTorch☆16Updated 4 years ago
- [ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP☆37Updated 2 years ago
- Large-Scale Bidirectional Training for Zero-Shot Image Captioning☆21Updated 2 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆108Updated 3 weeks ago
- Implementation of Multistream Transformers in Pytorch☆53Updated 3 years ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- ☆50Updated 2 years ago
- Pytorch implementation of StyleGAN2 in my style☆11Updated 2 years ago
- codebase for the SIMAT dataset and evaluation☆39Updated 3 years ago
- Evaluation benchmark for the task of Semantic Image Translation. Contains code to run FlexIT (CVPR 2022)☆35Updated 3 years ago
- Aggregating embeddings over time☆31Updated 2 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Updated 4 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Updated 3 years ago
- This is an official implementation of GRIT-VLP☆21Updated 2 years ago
- ☆34Updated last year
- SuperStyleNet: Deep Image Synthesis with Superpixel Based Style Encoder (BMVC 2021)☆27Updated 3 years ago
- ☆26Updated 3 years ago
- ☆22Updated last year
- ☆24Updated 3 years ago
- Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.☆35Updated 2 years ago
- Repository for LatentCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions, ICCV 2021☆55Updated 3 years ago