salesforce/VD-BERT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/salesforce/VD-BERT)

salesforce / VD-BERT

☆45

Alternatives and similar repositories for VD-BERT

Users that are interested in VD-BERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

idansc / mrr-ndcg
View on GitHub
☆18Jun 10, 2024Updated 2 years ago
vmurahari3 / visdial-bert
View on GitHub
Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379
☆95Mar 31, 2020Updated 6 years ago
gicheonkang / dan-visdial
View on GitHub
✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"
☆44Mar 19, 2023Updated 3 years ago
HKUST-KnowComp / VD-PCR
View on GitHub
Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"
☆10Nov 1, 2022Updated 3 years ago
wh0330 / CAG_VisDial
View on GitHub
☆15Aug 13, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HKUST-KnowComp / Visual_PCR
View on GitHub
Dataset and Source code for EMNLP 2019 paper "What You See is What You Get: Visual Pronoun Coreference Resolution in Dialogues"
☆26Sep 10, 2021Updated 4 years ago
gicheonkang / sglkt-visdial
View on GitHub
🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"
☆13Feb 1, 2023Updated 3 years ago
jokieleung / Maria
View on GitHub
PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".
☆23Sep 19, 2021Updated 4 years ago
shubhamagarwal92 / visdial_conv
View on GitHub
This repository contains code used in our ACL'20 paper History for Visual Dialog: Do we really need it?
☆33Mar 24, 2023Updated 3 years ago
ZihaoW123 / UniMM
View on GitHub
Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"
☆13May 12, 2023Updated 3 years ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
yuewang-cuhk / CMKP
View on GitHub
Official code and data for EMNLP 2020 paper "Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attenti…
☆21Nov 27, 2020Updated 5 years ago
jlian2 / mucko
View on GitHub
Pytorch Implementation of MUCKO(2020 IJCAI)
☆20Oct 25, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vmurahari3 / visdial-diversity
View on GitHub
Pytorch implementation of https://arxiv.org/pdf/1909.10470.pdf
☆32Aug 23, 2021Updated 4 years ago
phellonchen / awesome-visual-dialog
View on GitHub
Recent Advances in Visual Dialog
☆28Aug 19, 2022Updated 3 years ago
quangvnai / visdial
View on GitHub
Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)
☆29Aug 5, 2021Updated 4 years ago
JXZe / DualVD
View on GitHub
☆77Nov 22, 2022Updated 3 years ago
taesunwhang / MVAN-VisDial
View on GitHub
PyTorch Implementation of Multi-View Attention Networks for Visual Dialog
☆43Mar 24, 2023Updated 3 years ago
facebookresearch / corefnmn
View on GitHub
Visual Coreference Resolution in Visual Dialog using Neural Module Networks
☆58Oct 12, 2021Updated 4 years ago
luomancs / retriever_reader_for_okvqa
View on GitHub
☆19Dec 8, 2022Updated 3 years ago
batra-mlp-lab / visdial-challenge-starter-pytorch
View on GitHub
Starter code in PyTorch for the Visual Dialog challenge
☆188Mar 24, 2023Updated 3 years ago
neulab / ToM-Language-Acquisition
View on GitHub
Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".
☆15Apr 27, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jialinwu17 / MAVEX
View on GitHub
☆30Dec 16, 2022Updated 3 years ago
yuleiniu / rva
View on GitHub
Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"
☆64Mar 24, 2023Updated 3 years ago
ItemZheng / KDDAug
View on GitHub
[ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering
☆13Nov 23, 2022Updated 3 years ago
yanxinzju / CSS-VQA
View on GitHub
Counterfactual Samples Synthesizing for Robust VQA
☆78Nov 24, 2022Updated 3 years ago
yashkant / concat-vqa
View on GitHub
Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021
☆19Jul 27, 2021Updated 4 years ago
henryhungle / MTN
View on GitHub
Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)
☆100Oct 17, 2022Updated 3 years ago
zhegan27 / VILLA
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER…
☆119Jan 13, 2021Updated 5 years ago
jiasenlu / bottom-up-attention
View on GitHub
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
☆23Aug 22, 2019Updated 6 years ago
sairin1202 / Commonsense-Knowledge-Aware-Concept-Selection-For-Diverse-and-Informative-Visual-Storytelling
View on GitHub
The implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling
☆12Aug 19, 2021Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
cdancette / rubi.bootstrap.pytorch
View on GitHub
NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering
☆66Mar 29, 2021Updated 5 years ago
microsoft / Unicoder
View on GitHub
Unicoder model for understanding and generation.
☆91Dec 12, 2023Updated 2 years ago
yangxuntu / catt
View on GitHub
☆12Mar 8, 2021Updated 5 years ago
idansc / fga
View on GitHub
☆30Oct 20, 2021Updated 4 years ago
yuleiniu / cfvqa
View on GitHub
[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias
☆136Dec 15, 2021Updated 4 years ago
gicheonkang / gst-visdial
View on GitHub
Official PyTorch Implementation for CVPR'23 Paper, "The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training"
☆20Dec 11, 2023Updated 2 years ago
coldmanck / RVL-BERT
View on GitHub
The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…
☆18Oct 21, 2022Updated 3 years ago