princetonvisualai/pointingqa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/princetonvisualai/pointingqa)

princetonvisualai / pointingqa

Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"

☆19

Alternatives and similar repositories for pointingqa

Users that are interested in pointingqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yuleiniu / introd
View on GitHub
[NeurIPS 2021] Introspective Distillation for Robust Question Answering
☆13Dec 7, 2021Updated 4 years ago
allenai / aokvqa
View on GitHub
Official repository for the A-OKVQA dataset
☆117May 8, 2024Updated 2 years ago
zaynmi / seada-vqa
View on GitHub
A pytorch implemetation of data augmentation method for visual question answering
☆21May 25, 2023Updated 3 years ago
yonatanbitton / wysiwyr
View on GitHub
☆37Oct 7, 2023Updated 2 years ago
Zhiquan-Wen / D-VQA
View on GitHub
PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)
☆26Oct 13, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
archiki / RepARe
View on GitHub
☆21Oct 10, 2023Updated 2 years ago
rohan598 / ConTextual
View on GitHub
☆27Jul 20, 2024Updated 2 years ago
lupantech / IconQA
View on GitHub
Data and code for NeurIPS 2021 Paper "IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning".
☆55Jan 28, 2024Updated 2 years ago
AndersonStra / Mucko
View on GitHub
implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering
☆10Mar 17, 2022Updated 4 years ago
Gary-code / KECVQG
View on GitHub
[ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"
☆10Sep 3, 2024Updated last year
mugen-org / MUGEN_coinrun
View on GitHub
A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset. This repo contains scripts …
☆13Jul 13, 2022Updated 4 years ago
minrq / CGAN_Text2Video
View on GitHub
Code for our IJCAI 2019 paper entitled "Conditional GAN with Discriminative Filter Generation for Text-to-Video Synthesis"
☆14Mar 29, 2022Updated 4 years ago
wusize / F-LMM
View on GitHub
[CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models
☆115May 29, 2025Updated last year
locuslab / T-MARS
View on GitHub
Code for T-MARS data filtering
☆35Aug 23, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yonseivnl / cmota
View on GitHub
☆10Sep 12, 2024Updated last year
linzhiqiu / visual_gpt_score
View on GitHub
VisualGPTScore for visio-linguistic reasoning
☆27Oct 7, 2023Updated 2 years ago
kevinyaobytedance / llm_eval
View on GitHub
LLM evaluation.
☆16Nov 7, 2023Updated 2 years ago
gqa-ood / GQA-OOD
View on GitHub
GQA-OOD is a new dataset and benchmark for the evaluation of VQA models in OOD (out of distribution) settings.
☆33Mar 1, 2021Updated 5 years ago
ZhangYuanhan-AI / OmniBenchmark
View on GitHub
[ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.
☆110Dec 8, 2023Updated 2 years ago
Adam1679 / mutan-article-net
View on GitHub
Implementation of Mutan+ArticleNet on OKVQA
☆10Jan 11, 2021Updated 5 years ago
scwangdyd / large_vocabulary_hoi_detection
View on GitHub
Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection
☆28Oct 12, 2021Updated 4 years ago
Taaccoo / awesome-vqa-latest
View on GitHub
Visual Question Answering Paper List.
☆52Aug 19, 2022Updated 3 years ago
zhaohengyuan1 / SCT
View on GitHub
(IJCV 2023) Offical implementation of "SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels"
☆13Mar 20, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
microsoft / DFOL-VQA
View on GitHub
Differentiable First-Order Logic Reasoning for Visual Question Answering
☆45Mar 7, 2021Updated 5 years ago
LivXue / VCIN
View on GitHub
Authors's code for "Variational Causal Inference Network for Explanatory Visual Question Answering" and "Integrating Neural-Symbolic Reas…
☆13Apr 13, 2026Updated 3 months ago
hula-ai / skin_lesion_uncertainty_estimation
View on GitHub
Code and models for our paper "Risk-Aware Machine Learning Classifier for Skin Lesion Diagnosis"
☆10Aug 2, 2024Updated last year
AgarwalVedika / CausalVQA
View on GitHub
☆12Jun 17, 2020Updated 6 years ago
mugen-org / MUGEN_baseline
View on GitHub
multimodal video-audio-text generation and retrieval between every pair of modalities on the MUGEN dataset. The repo. contains the traini…
☆42Apr 1, 2023Updated 3 years ago
rohandkn / skribble2vid
View on GitHub
☆24May 28, 2023Updated 3 years ago
guoyang9 / UnifER
View on GitHub
Official implementation for the MM'22 paper.
☆14Jun 30, 2022Updated 4 years ago
alvations / expletives
View on GitHub
Expletives vomiting library...
☆13Apr 18, 2026Updated 3 months ago
boyazeng / understand_bias
View on GitHub
Code release for "Understanding Bias in Large-Scale Visual Datasets"
☆25Dec 4, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
aurooj / MMFT-BERT
View on GitHub
☆14Jun 29, 2024Updated 2 years ago
SivanDoveh / DAC
View on GitHub
Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models
☆28Nov 29, 2023Updated 2 years ago
SY-Xuan / vibe_python
View on GitHub
a implementation of vibe with python
☆11Jul 27, 2018Updated 7 years ago
cdancette / rubi.bootstrap.pytorch
View on GitHub
NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering
☆66Mar 29, 2021Updated 5 years ago
slp-rl / SpokenStoryCloze
View on GitHub
A spoken version of the textual story cloze benchmark
☆22Aug 6, 2023Updated 2 years ago
virtualgraham / sc_patch
View on GitHub
☆12Dec 16, 2020Updated 5 years ago
googleapis / google-cloud-php-ai-platform
View on GitHub
☆23Jul 13, 2026Updated last week