MIMICLab / L-VerseLinks
L-Verse: Bidirectional Generation Between Image and Text
☆107Updated 8 months ago
Alternatives and similar repositories for L-Verse
Users that are interested in L-Verse are comparing it to the libraries listed below
Sorting:
- ☆96Updated last week
- ☆47Updated last year
- DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)☆143Updated 6 months ago
- Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)☆183Updated 2 years ago
- ☆53Updated 3 years ago
- PyTorch code for MUST☆107Updated 7 months ago
- ☆34Updated 2 years ago
- Official code repository for the EMNLP 2021 paper☆26Updated 3 years ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆56Updated last year
- Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)☆56Updated last year
- Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic☆280Updated 3 years ago
- Large-Scale Bidirectional Training for Zero-Shot Image Captioning☆21Updated 2 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆98Updated 2 years ago
- ☆48Updated 4 years ago
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis☆125Updated 3 years ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆86Updated 2 years ago
- ☆120Updated 2 years ago
- ☆45Updated 3 years ago
- CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification - 4th Workshop on Computer Vision for Fashion, Art, and Design☆28Updated 3 years ago
- Release of ImageNet-Captions☆51Updated 2 years ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆65Updated 3 years ago
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆85Updated 3 years ago
- Official code of "StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis" (CVPR 2022)☆43Updated 3 years ago
- This is an official implementation of GRIT-VLP☆20Updated 3 years ago
- ☆30Updated 2 years ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆102Updated 3 years ago
- ☆55Updated 2 years ago
- ☆26Updated 4 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆80Updated 3 years ago
- [ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP☆38Updated 3 years ago