tezansahu / VQA-With-Multimodal-Transformers

Exploring multimodal fusion-type transformer models for visual question answering (on DAQUAR dataset)
33Updated 2 years ago

Related projects: