yousefkotp / Visual-Question-AnsweringLinks
A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder
☆12Updated 2 years ago
Alternatives and similar repositories for Visual-Question-Answering
Users that are interested in Visual-Question-Answering are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of image captioning using transformer-based model.☆66Updated 2 years ago
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆80Updated 5 months ago
- Pytorch implementation of VQA: Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf) using VQA v2.0 dataset for open-ended ta…☆20Updated 5 years ago
- 【AAAI 2024】An Empirical Study of CLIP for Text-based Person Search☆67Updated last year
- Few-shot Object Counting and Detection (ECCV 2022)☆75Updated 9 months ago
- Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale person image databas…☆25Updated 2 months ago
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆130Updated 6 months ago
- Awesome List of Vision Language Prompt Papers☆46Updated last year
- [ECCV'22] Official repository of paper titled "Class-agnostic Object Detection with Multi-modal Transformer".☆312Updated 2 years ago
- FInetuning CLIP for Few Shot Learning☆44Updated 3 years ago
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆181Updated last year
- GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)☆195Updated 2 years ago
- ICCV 2023: CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No☆139Updated last year
- code for studying OpenAI's CLIP explainability☆33Updated 3 years ago
- [CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detect…☆58Updated 4 months ago
- A curated list of Computer Vision related conferences with dates and paper registration deadlines.☆35Updated last month