tezansahu / VQA-With-Multimodal-Transformers

Exploring multimodal fusion-type transformer models for visual question answering (on DAQUAR dataset)
34Updated 3 years ago

Alternatives and similar repositories for VQA-With-Multimodal-Transformers:

Users that are interested in VQA-With-Multimodal-Transformers are comparing it to the libraries listed below