DeVLBert: Learning Deconfounded Visio-Linguistic Representations
☆27Nov 27, 2022Updated 3 years ago
Alternatives and similar repositories for DeVLBert
Users that are interested in DeVLBert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago
- Comprehensive Information Integration Modeling Framework for Video Titling☆11Aug 27, 2020Updated 5 years ago
- Counterfactual Samples Synthesizing for Robust VQA☆79Nov 24, 2022Updated 3 years ago
- ☆15Aug 20, 2024Updated last year
- A pytorch implementation of D3Net.☆11Aug 8, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- ☆18Dec 2, 2018Updated 7 years ago
- ☆15Nov 28, 2024Updated last year
- TallyQA: Answering Complex Counting Questions dataset☆29Feb 19, 2024Updated 2 years ago
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆131Dec 15, 2021Updated 4 years ago
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago
- ☆13Feb 1, 2022Updated 4 years ago
- MMBERT: Multimodal BERT Pretraining for Improved Medical VQA☆39Mar 22, 2021Updated 5 years ago
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆100Mar 30, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Voice Conversion using Tacotron.☆11Dec 29, 2022Updated 3 years ago
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆25Mar 28, 2023Updated 3 years ago
- ☆25May 12, 2025Updated 10 months ago
- [CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆79Feb 27, 2026Updated last month
- Chain of Images for Intuitively Reasoning☆10Nov 29, 2023Updated 2 years ago
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆35Dec 5, 2022Updated 3 years ago
- Reimplementation of the paper `Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? (ACL2…☆17Jul 10, 2020Updated 5 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- Repository of proposal-free temporal moment localization work☆33Jun 11, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [CVPR 2020] The official pytorch implementation of ``Visual Commonsense R-CNN''☆359May 2, 2021Updated 4 years ago
- The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…☆18Oct 21, 2022Updated 3 years ago
- Codebase for LangNav paper☆19Jun 13, 2024Updated last year
- A comparison of human attention with computational attention mechanisms☆12Jul 3, 2020Updated 5 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Mar 31, 2019Updated 6 years ago
- ☆16Nov 14, 2018Updated 7 years ago
- List of Publications in Graph Contrastive Learning☆35May 5, 2022Updated 3 years ago
- A collection of graph contrastive learning methods.☆18Apr 1, 2022Updated 3 years ago
- TPSE-GST Tacotron2☆14May 1, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆15Aug 4, 2025Updated 7 months ago
- Referring Expression Parser☆27Feb 10, 2018Updated 8 years ago
- Parkar and Kim et al.'s paper on Can LLMs Select Important Instructions to Annotate?"☆13Jul 4, 2024Updated last year
- ☆12Jun 30, 2024Updated last year
- A reading list of papers about Visual Question Answering.☆35Aug 17, 2022Updated 3 years ago
- Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering☆11Feb 16, 2023Updated 3 years ago
- baselines for DocVQA dataset☆21Apr 11, 2021Updated 4 years ago