Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"
☆19Oct 4, 2022Updated 3 years ago
Alternatives and similar repositories for pointingqa
Users that are interested in pointingqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pytorch implemetation of data augmentation method for visual question answering☆21May 25, 2023Updated 2 years ago
- ☆37Oct 7, 2023Updated 2 years ago
- PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)☆27Oct 13, 2022Updated 3 years ago
- ☆27Jul 20, 2024Updated last year
- ☆17Feb 22, 2024Updated 2 years ago
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Dec 9, 2021Updated 4 years ago
- ☆27Mar 21, 2024Updated 2 years ago
- A spoken version of the textual story cloze benchmark☆20Aug 6, 2023Updated 2 years ago
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Mar 17, 2022Updated 4 years ago
- 基于 React + router + redux + axios 和 Flask + MySQL + Pytorch 的视觉问答管理系统☆10Dec 12, 2022Updated 3 years ago
- [CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆109May 29, 2025Updated 9 months ago
- Code for T-MARS data filtering☆35Aug 23, 2023Updated 2 years ago
- ☆10Sep 12, 2024Updated last year
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- Visual Question Answering Paper List.☆53Aug 19, 2022Updated 3 years ago
- Code for the experiments in the ACL 2020 paper "Estimating predictive uncertainty for rumour verification models"☆11May 15, 2020Updated 5 years ago
- GQA-OOD is a new dataset and benchmark for the evaluation of VQA models in OOD (out of distribution) settings.☆32Mar 1, 2021Updated 5 years ago
- Differentiable First-Order Logic Reasoning for Visual Question Answering☆44Mar 7, 2021Updated 5 years ago
- ☆12Jan 10, 2025Updated last year
- Tukey-Inspired Video Object Segmentation☆19Apr 9, 2025Updated 11 months ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- The project is an official implementation of our paper " RSGNet: Relation based Skeleton Graph Network for Crowded Scenes Pose Estimation…☆10Dec 9, 2020Updated 5 years ago
- Authors's code for "Variational Causal Inference Network for Explanatory Visual Question Answering" and "Integrating Neural-Symbolic Reas…☆12Jun 27, 2025Updated 8 months ago
- Code and models for our paper "Risk-Aware Machine Learning Classifier for Skin Lesion Diagnosis"☆10Aug 2, 2024Updated last year
- ☆12Jun 17, 2020Updated 5 years ago
- Official implementation for the MM'22 paper.☆14Jun 30, 2022Updated 3 years ago
- ☆24May 28, 2023Updated 2 years ago
- ☆16Feb 12, 2026Updated last month
- Relation Networks for CLEVR implemented in PyTorch☆61Jun 11, 2018Updated 7 years ago
- multimodal video-audio-text generation and retrieval between every pair of modalities on the MUGEN dataset. The repo. contains the traini…☆40Apr 1, 2023Updated 2 years ago
- Code use to create COCO Attributes dataset and experiments in the associate ECCV 2016 paper.☆49Dec 26, 2022Updated 3 years ago
- ☆24Jun 18, 2025Updated 9 months ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆23Dec 4, 2024Updated last year
- ☆14May 10, 2021Updated 4 years ago
- ☆14Jun 29, 2024Updated last year
- ☆12Dec 16, 2020Updated 5 years ago
- Explaining Autonomous Driving Actions with Visual Question Answering (IEEE ITSC-2023)☆19Feb 15, 2024Updated 2 years ago
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Oct 11, 2023Updated 2 years ago