MIMICLab / L-VerseLinks
L-Verse: Bidirectional Generation Between Image and Text
☆109Updated 6 months ago
Alternatives and similar repositories for L-Verse
Users that are interested in L-Verse are comparing it to the libraries listed below
Sorting:
- ☆97Updated 2 months ago
- ☆46Updated last year
- DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)☆142Updated 4 months ago
- Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)☆183Updated 2 years ago
- ☆53Updated 3 years ago
- Official code repository for the EMNLP 2021 paper☆26Updated 3 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆97Updated 2 years ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆85Updated 2 years ago
- Large-Scale Bidirectional Training for Zero-Shot Image Captioning☆21Updated 2 years ago
- PyTorch code for MUST☆107Updated 5 months ago
- Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)☆56Updated last year
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆55Updated last year
- Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic☆279Updated 3 years ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆65Updated 3 years ago
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis☆125Updated 3 years ago
- ☆30Updated 2 years ago
- ☆45Updated 3 years ago
- ☆120Updated 2 years ago
- ☆26Updated 4 years ago
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 4 years ago
- Simple script to compute CLIP-based scores given a DALL-e trained model.☆30Updated 4 years ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆102Updated 3 years ago
- [ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP☆38Updated 3 years ago
- This is an official implementation of GRIT-VLP☆21Updated 3 years ago
- Official code of "StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis" (CVPR 2022)☆43Updated 3 years ago
- ☆47Updated 4 years ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Updated 4 years ago
- Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.☆66Updated 3 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago
- CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification - 4th Workshop on Computer Vision for Fashion, Art, and Design☆28Updated 3 years ago