MIMICLab / L-VerseLinks
L-Verse: Bidirectional Generation Between Image and Text
☆107Updated 9 months ago
Alternatives and similar repositories for L-Verse
Users that are interested in L-Verse are comparing it to the libraries listed below
Sorting:
- ☆96Updated 3 weeks ago
- ☆47Updated last year
- DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)☆143Updated 7 months ago
- ☆54Updated 3 years ago
- Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)☆183Updated 2 years ago
- ☆34Updated 2 years ago
- Official code repository for the EMNLP 2021 paper☆26Updated 4 years ago
- Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)☆56Updated last year
- Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic☆278Updated 3 years ago
- Large-Scale Bidirectional Training for Zero-Shot Image Captioning☆21Updated 2 years ago
- ☆30Updated 3 years ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆86Updated 2 years ago
- ☆48Updated 4 years ago
- ☆45Updated 4 years ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆56Updated last year
- PyTorch code for MUST☆108Updated 8 months ago
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis☆124Updated 3 years ago
- Official Pytorch implementation of GGDR (ECCV 2022)☆102Updated 3 years ago
- Simple script to compute CLIP-based scores given a DALL-e trained model.☆29Updated 4 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆100Updated 2 years ago
- [ICLR-2023] Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images☆66Updated 3 years ago
- Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.☆66Updated 3 years ago
- ☆26Updated 4 years ago
- ☆122Updated 2 years ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆102Updated 3 years ago
- A unified framework to jointly model images, text, and human attention traces.☆79Updated 4 years ago
- [ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP☆38Updated 3 years ago
- This is an official implementation of GRIT-VLP☆20Updated 3 years ago
- Release of ImageNet-Captions☆51Updated 3 years ago
- pytorch implementation of XMC-GAN☆11Updated 4 years ago