yousefkotp / Visual-Question-Answering

A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder
10Updated last year

Alternatives and similar repositories for Visual-Question-Answering:

Users that are interested in Visual-Question-Answering are comparing it to the libraries listed below