yldcs / Unsupervised_Text-to-Image_Synthesis
Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis
☆13Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Unsupervised_Text-to-Image_Synthesis
- RG-UNIT, ACM MM 2020.☆11Updated 3 years ago
- An pytorch implementation of our NeurIPS paper of PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph☆53Updated last year
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆17Updated 3 years ago
- The method of text-to-image☆48Updated 4 years ago
- PyTorch Implementation of "ZstGAN: An Adversarial Approach for Unsupervised Zero-Shot Image-to-Image Translation"☆41Updated 5 years ago
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆21Updated 2 years ago
- A list of papers and other resources on language-guided image editing.☆37Updated 3 years ago
- ☆35Updated last year
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated last year
- LeicaGAN-Pytorch☆35Updated 4 years ago
- sairin1202 / Commonsense-Knowledge-Aware-Concept-Selection-For-Diverse-and-Informative-Visual-StorytellingThe implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling☆11Updated 3 years ago
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Updated 2 years ago
- ☆11Updated 4 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- ☆24Updated 3 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances…☆45Updated 3 years ago
- Code of our Neurips2020 paper "Auto Learning Attention", coming soon☆21Updated 3 years ago
- The code for paper "Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation" which is accepted by AAAI 2022☆10Updated 2 years ago
- One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations. NeurIPS2022.☆34Updated last year
- ☆13Updated 3 years ago
- support Large Vocabulary Instance Segmentation (LVIS) dataset for mmdetection☆16Updated 4 years ago
- ☆74Updated 2 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 3 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Updated last year
- When can you tell whether an image has been cropped or not?☆29Updated 3 years ago
- ☆51Updated 4 years ago
- Multi-sense word embeddings from visual co-occurrences☆25Updated 5 years ago
- Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019☆51Updated 4 years ago
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Updated 2 years ago