alexandrosXe / A-Simple-Baseline-For-Knowledge-Based-VQAView external linksLinks
Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"
☆25Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for A-Simple-Baseline-For-Knowledge-Based-VQA
Users that are interested in A-Simple-Baseline-For-Knowledge-Based-VQA are comparing it to the libraries listed below
Sorting:
- [CVPR'25] AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data☆17Mar 27, 2025Updated 10 months ago
- Local self-attention in Transformer for visual question answering☆13Mar 17, 2024Updated last year
- ☆11Jun 20, 2023Updated 2 years ago
- the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering☆13Aug 22, 2023Updated 2 years ago
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆16Jul 20, 2023Updated 2 years ago
- ☆69Jul 25, 2024Updated last year
- ☆17Dec 13, 2023Updated 2 years ago
- Official implementation for the MM'22 paper.☆14Jun 30, 2022Updated 3 years ago
- ☆18May 31, 2023Updated 2 years ago
- An automatic MLLM hallucination detection framework☆19Sep 26, 2023Updated 2 years ago
- Using image captions with LLM for zero-shot VQA☆18Mar 14, 2024Updated last year
- Official Repository for VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning☆25Apr 12, 2024Updated last year
- ☆20Sep 17, 2022Updated 3 years ago
- ICML23 "Latent Traversals in Generative Models as Potential Flows"☆27Oct 23, 2023Updated 2 years ago
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆61Jul 16, 2024Updated last year
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆26Oct 17, 2024Updated last year
- ASCL: adpative Soft Contrastive Learning (ICPR2022)☆22Mar 22, 2025Updated 10 months ago
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆25May 30, 2024Updated last year
- Counterfactual Reasoning VQA Dataset☆27Nov 23, 2023Updated 2 years ago
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆34Nov 13, 2024Updated last year
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆69Jul 11, 2022Updated 3 years ago
- HallE-Control: Controlling Object Hallucination in LMMs☆31Apr 10, 2024Updated last year
- Code for ICLR 2022 Paper, "Controlling Directions Orthogonal to a Classifier"☆35Jun 6, 2023Updated 2 years ago
- SSR: An Efficient and Robust Framework for Learning with Unknown Label Noise (BMVC2022)☆31Mar 22, 2025Updated 10 months ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆42Apr 27, 2025Updated 9 months ago
- Pytorch implementation of our CVPR 23' paper DivClust: Controlling Diversity in Deep Clustering.☆31Aug 30, 2023Updated 2 years ago
- [ICME 2025 Oral] Official implementation of "GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection"☆34Mar 23, 2025Updated 10 months ago
- Code for paper "Object landmark discovery through unsupervised adaptation"☆38Nov 14, 2019Updated 6 years ago
- ☆44Jan 21, 2025Updated last year
- Authors official PyTorch implementation of the "ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences".☆42Oct 1, 2022Updated 3 years ago
- natual language guided image captioning☆87Feb 11, 2024Updated 2 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Jan 14, 2026Updated last month
- Fine-Grained Knowledge Fusion for Retrieval-Augmented Medical Visual Question☆11Jul 18, 2024Updated last year
- Implementation for the paper "Reliable Visual Question Answering Abstain Rather Than Answer Incorrectly" (ECCV 2022: https//arxiv.org/abs…☆38May 19, 2023Updated 2 years ago
- Improving Continuous Sign Language Recognition with Adapted Image Models☆14Nov 10, 2025Updated 3 months ago
- ☆12Jun 26, 2024Updated last year
- Using LLMs and pre-trained caption models for super-human performance on image captioning.☆42Oct 13, 2023Updated 2 years ago
- Machine learning for malware detection☆11Aug 2, 2016Updated 9 years ago