zeyofu / TARA
☆16Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for TARA
- Official repository for the A-OKVQA dataset☆64Updated 6 months ago
- Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).☆20Updated 2 years ago
- ☆15Updated 2 years ago
- ☆63Updated 5 years ago
- PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)☆22Updated 2 years ago
- ☆47Updated 4 months ago
- ☆68Updated last year
- ☆32Updated last year
- The SVO-Probes Dataset for Verb Understanding☆31Updated 2 years ago
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆16Updated 5 months ago
- M-HalDetect Dataset Release☆19Updated last year
- ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities☆33Updated 2 months ago
- Counterfactual Reasoning VQA Dataset☆24Updated last year
- my commonly-used tools☆47Updated 3 months ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆30Updated last year
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Updated 2 years ago
- ☆83Updated 2 years ago
- [EMNLP 2022] TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data☆17Updated last year
- ☆28Updated last year
- ☆25Updated 2 weeks ago
- Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"☆18Updated last year
- Codebase for AAAI 2024 conference paper Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning☆18Updated 4 months ago
- ☆46Updated last month
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆41Updated last year
- NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media, EMNLP 2021☆34Updated 2 months ago
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆17Updated 2 months ago
- [TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.☆107Updated last year
- [NeurIPS 2023] A faithful benchmark for vision-language compositionality☆70Updated 9 months ago
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆74Updated 6 months ago
- A Survey on the Honesty of Large Language Models☆47Updated last month