UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning
☆70May 20, 2021Updated 4 years ago
Alternatives and similar repositories for UNIMO
Users that are interested in UNIMO are comparing it to the libraries listed below
Sorting:
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆27Aug 20, 2022Updated 3 years ago
- Learning Cross-modal Retrieval with Noisy Labels (CVPR 2021, PyTorch Code)☆13Apr 7, 2021Updated 4 years ago
- Code for ACL 2018 paper "Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference".☆17Aug 5, 2018Updated 7 years ago
- P-tuning方法在中文上的简单实验☆140Apr 8, 2021Updated 4 years ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆800Jun 30, 2021Updated 4 years ago
- ☆17May 8, 2014Updated 11 years ago
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- ☆14May 3, 2022Updated 3 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- A pytorch implementation of the ICCV2021 workshop paper SimDis: Simple Distillation Baselines for Improving Small Self-supervised Models☆14Jul 15, 2021Updated 4 years ago
- [AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”☆219Apr 11, 2024Updated last year
- Adversarial Training for Natural Language Understanding☆253Sep 6, 2023Updated 2 years ago
- Code Release for "On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies"☆16Apr 13, 2021Updated 4 years ago
- Source code and checkpoints for legal pre-trained language models.☆15May 9, 2021Updated 4 years ago
- Source code of our TMM 2019 paper "Multi-pathway Generative Adversarial Hashing for Unsupervised Cross-modal Retrieval"☆12Jun 17, 2019Updated 6 years ago
- ☆14May 10, 2021Updated 4 years ago
- Unofficial implementation of Face0 with SDXL☆12Sep 1, 2023Updated 2 years ago
- ☆478Nov 21, 2022Updated 3 years ago
- Recent Advances in Vision and Language PreTrained Models (VL-PTMs)☆1,155Aug 19, 2022Updated 3 years ago
- ☆218Feb 26, 2022Updated 4 years ago
- Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching☆170Oct 12, 2020Updated 5 years ago
- Official Github repo of the VIST Challenge NAACL 2018☆17Aug 3, 2018Updated 7 years ago
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆18Nov 23, 2023Updated 2 years ago
- PyTorch code for EMNLP 2020 Paper "Vokenization: Improving Language Understanding with Visual Supervision"☆192Mar 8, 2021Updated 4 years ago
- A dataset of crowdsourced ratings for machine-generated image captions☆38Sep 20, 2019Updated 6 years ago
- Use CLIP to represent video for Retrieval Task☆70Mar 1, 2021Updated 5 years ago
- The offical code for paper "Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking", ACM Multimedia 2019 Oral☆68Sep 28, 2019Updated 6 years ago
- Tensorflow Reproduction of the EMNLP-2018 paper "Temporally Grounding Natural Sentence in Video"☆17Nov 21, 2022Updated 3 years ago
- Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.☆224Sep 9, 2021Updated 4 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Jan 12, 2023Updated 3 years ago
- [Codes of paper]: Region-based Non-local operation for Video Classification☆18Nov 28, 2021Updated 4 years ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆162May 30, 2022Updated 3 years ago
- Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)☆34Jul 2, 2020Updated 5 years ago
- This is the implementation of self-CIDEr and LSA-based diversity metrics (only for python 2.7).☆36Feb 26, 2022Updated 4 years ago
- Code for the HowTo100M paper☆293Mar 10, 2020Updated 5 years ago
- PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".☆966Oct 22, 2022Updated 3 years ago
- Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)☆164Aug 24, 2025Updated 6 months ago