Cloud-CV/vilbert-multi-task

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Cloud-CV/vilbert-multi-task)

Cloud-CV / vilbert-multi-task

12-in-1: Multi-Task Vision and Language Representation Learning Web Demo

☆35

Alternatives and similar repositories for vilbert-multi-task

Users that are interested in vilbert-multi-task are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
yashkant / concat-vqa
View on GitHub
Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021
☆19Jul 27, 2021Updated 4 years ago
aditya-AI / Information-Retrieval-System-using-BERT
View on GitHub
☆15Feb 5, 2019Updated 7 years ago
guoyang9 / UnifER
View on GitHub
Official implementation for the MM'22 paper.
☆14Jun 30, 2022Updated 4 years ago
wangzheallen / STL-VQA
View on GitHub
The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…
☆19Jan 23, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
BierOne / relation-vqa
View on GitHub
Re-implementation for 'R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering'.
☆12Mar 13, 2026Updated 4 months ago
oncescuandreea / QuerYD_downloader
View on GitHub
☆23Dec 5, 2023Updated 2 years ago
sanket0211 / WK-VQA
View on GitHub
World Knowledge Based Visual Question Answering
☆22Nov 26, 2020Updated 5 years ago
rowanz / merlot
View on GitHub
MERLOT: Multimodal Neural Script Knowledge Models
☆226Mar 15, 2022Updated 4 years ago
berniebear / Multi-HT100M
View on GitHub
☆53Dec 6, 2021Updated 4 years ago
zongshenmu / attention_knowledge_vqa
View on GitHub
vqa drived by bottom-up and top-down attention and knowledge
☆14Nov 21, 2018Updated 7 years ago
facebookresearch / vilbert-multi-task
View on GitHub
Multi Task Vision and Language
☆824Feb 16, 2022Updated 4 years ago
salesforce / VD-BERT
View on GitHub
☆45Jun 16, 2025Updated last year
alibabadoufu / dynamic_fusion_reimplementation
View on GitHub
Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering
☆17Oct 30, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sergiotasconmorales / consistency_vqa
View on GitHub
Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)
☆26Mar 28, 2023Updated 3 years ago
zhegan27 / VILLA
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER…
☆119Jan 13, 2021Updated 5 years ago
zjuchenlong / faster-rcnn.pytorch
View on GitHub
fork from https://github.com/jwyang/faster-rcnn.pytorch
☆10Aug 6, 2018Updated 7 years ago
google-research-datasets / Crisscrossed-Captions
View on GitHub
Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO
☆54Sep 3, 2020Updated 5 years ago
universome / firelab
View on GitHub
Experimental framework for running pytorch experiments
☆14Mar 6, 2023Updated 3 years ago
jhuang81 / weak-sup-visual-grounding
View on GitHub
The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.
☆12Oct 15, 2021Updated 4 years ago
facebookresearch / ProcedureVRL
View on GitHub
[CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"
☆56Aug 8, 2023Updated 2 years ago
swapnil96 / Background-Subtraction
View on GitHub
GMM model for subtracting background from foreground
☆10Nov 10, 2020Updated 5 years ago
blengerich / explainable-cnn
View on GitHub
Towards Visual Explanations for Convolutional Neural Networks via Input Resampling
☆13Aug 16, 2017Updated 8 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
keithnoguchi / do-in-action
View on GitHub
DO with Terraform and Ansible
☆11Jun 5, 2018Updated 8 years ago
yikuan8 / Transformers-VQA
View on GitHub
An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER
☆165Dec 11, 2022Updated 3 years ago
giannisnik / k-hop-gnns
View on GitHub
k-hop Graph Neural Networks
☆19Jul 17, 2020Updated 6 years ago
YirongMao / COSONet
View on GitHub
The source code for the paper: Yirong Mao, Ruiping Wang, Shiguang Shan, Xilin Chen. COSONet: Compact Second-Order Network for Video Face …
☆12Dec 27, 2018Updated 7 years ago
Anup-Deshmukh / TREC_background_linking
View on GitHub
IR-BERT at TREC 2020: Leveraging BERT for Semantic Search in Background Linking
☆15Feb 21, 2022Updated 4 years ago
NeverMoreLCH / Awesome-VQA
View on GitHub
A reading list of papers about Visual Question Answering.
☆35Aug 17, 2022Updated 3 years ago
AranKomat / Metroplex
View on GitHub
☆21Mar 15, 2023Updated 3 years ago
China-UK-ZSL / ZS-F-VQA
View on GitHub
[Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph
☆72Feb 9, 2024Updated 2 years ago
UCSB-AI / CPL
View on GitHub
Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"
☆35Dec 5, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
criteo / fromconfig
View on GitHub
A library to instantiate any Python object from configuration files.
☆25Oct 12, 2022Updated 3 years ago
psaylor / spoke
View on GitHub
A framework for building speech-enabled websites.
☆10Jul 10, 2015Updated 11 years ago
2snoopy88 / GAT-with-batch
View on GitHub
implement gat with batch
☆10Nov 28, 2020Updated 5 years ago
digitalepidemiologylab / crowdbreaks-paper
View on GitHub
Material related to paper "Crowdbreaks: Tracking Health Trends using Public Social Media Data and Crowdsourcing"
☆12May 19, 2020Updated 6 years ago
cnzeki / margin-centre-face
View on GitHub
Face recognition
☆11Jun 20, 2019Updated 7 years ago
AasthaGupta / Twitter-Social-Graph
View on GitHub
Graphical Analysis of Twitter Social Media Community using Gephi. This is a part of Academic Project Report under the course Programming-…
☆16Apr 12, 2017Updated 9 years ago
GeorgeKyriakides / nord
View on GitHub
Deep neural architecture research framework
☆12Mar 24, 2023Updated 3 years ago