black4321 / InterBERT
The official implementation of InterBERT
☆11Updated 2 years ago
Alternatives and similar repositories for InterBERT:
Users that are interested in InterBERT are comparing it to the libraries listed below
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Updated 3 years ago
- ☆20Updated 3 years ago
- [EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…☆20Updated 3 years ago
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆72Updated 2 years ago
- [ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources☆44Updated 2 years ago
- ☆13Updated 5 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆32Updated last year
- 👾 A library of state-of-the-art pretrained models for Natural Language Processing (NLP)☆9Updated 5 years ago
- For visual commonsense model☆34Updated 6 years ago
- ☆11Updated 4 years ago
- [AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps☆24Updated 2 years ago
- A PyTorch implementation of our proposed loss function from the paper "SimLoss: Class Similarities in Cross Entropy"☆25Updated 3 years ago
- ☆57Updated 3 years ago
- The project is about predicting sets (of classes) from images.☆22Updated 3 years ago
- Research code for "Training Vision-Language Transformers from Captions Alone"☆34Updated 2 years ago
- Official implementation for paper "Relational Surrogate Loss Learning", ICLR 2022☆37Updated 2 years ago
- CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment☆22Updated 3 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆11Updated 4 months ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 3 years ago
- ☆51Updated 4 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Updated 3 years ago
- Phrase Localization Evaluation Toolkit☆20Updated 5 years ago
- Rethinking Nearest Neighbors for Visual Classification☆31Updated 3 years ago
- UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning☆70Updated 3 years ago
- FlatNCE: A Novel Contrastive Representation Learning Objective☆90Updated 3 years ago
- ☆37Updated 2 years ago
- Starter code for the VMT task and challenge☆51Updated 4 years ago
- WuDaoMM this is a data project☆73Updated 3 years ago
- Product1M☆87Updated 2 years ago
- Official code for the paper "Self-Distillation for Few-Shot Image Captioning"☆14Updated 4 years ago