[EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers"
☆20Jan 17, 2022Updated 4 years ago
Alternatives and similar repositories for cross-modal-ablation
Users that are interested in cross-modal-ablation are comparing it to the libraries listed below
Sorting:
- [TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-La…☆114Mar 24, 2022Updated 3 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆34Feb 5, 2023Updated 3 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Jan 12, 2023Updated 3 years ago
- Controllable mage captioning model with unsupervised modes☆21Apr 14, 2023Updated 2 years ago
- [CVPRW'23] The official PyTorch implementation of NamedMask☆23Jun 12, 2023Updated 2 years ago
- [EMNLP 2021] Code and data for our paper "Visually Grounded Reasoning across Languages and Cultures"☆30Dec 30, 2021Updated 4 years ago
- MERLOT: Multimodal Neural Script Knowledge Models☆225Mar 15, 2022Updated 3 years ago
- Official implementation of paper "ScatSimCLR: self-supervised contrastive learning with pretext task regularization for small-scale datas…☆26Sep 7, 2021Updated 4 years ago
- Prompt-learning methods used BERT4Keras (PET, EFL and NSP-BERT), both for Chinese and English.☆30Oct 12, 2022Updated 3 years ago
- Adaptive Class Suppression Loss for Long-Tail Object Detection--CVPR2021☆81Apr 6, 2021Updated 4 years ago
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆209Dec 18, 2022Updated 3 years ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago
- ☆16Feb 22, 2025Updated last year
- ☆14Mar 20, 2025Updated 11 months ago
- ☆10Nov 17, 2022Updated 3 years ago
- A comprehensive framework to explore whether embodied multimodal models are plausibly resilient☆13Nov 19, 2025Updated 3 months ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- The code for COPACRR Neural IR model.☆37Feb 6, 2018Updated 8 years ago
- official repo for `thinking with images through-self-calling`☆20Dec 28, 2025Updated 2 months ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Oct 18, 2021Updated 4 years ago
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes☆10Jul 10, 2024Updated last year
- A PyTorch implementation of Knowledge Graph Embedding by Normalizing Flows.☆10Nov 22, 2022Updated 3 years ago
- ☆10Sep 14, 2022Updated 3 years ago
- ☆11Apr 30, 2015Updated 10 years ago
- ☆11Nov 23, 2020Updated 5 years ago
- Temporal Random Indexing☆14Oct 3, 2024Updated last year
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆11Nov 16, 2021Updated 4 years ago
- Text-to-video generation.☆10Jul 22, 2022Updated 3 years ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- Official Repository for CLRCMD (Appear in ACL2022)☆43Feb 21, 2023Updated 3 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆12Oct 25, 2021Updated 4 years ago
- ☆13Nov 28, 2025Updated 3 months ago
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- ☆12Dec 9, 2025Updated 2 months ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 10 months ago
- VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆35Feb 3, 2026Updated last month
- 3D Scene Flow Estimation☆14Sep 24, 2025Updated 5 months ago