[CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering
☆22Sep 21, 2024Updated last year
Alternatives and similar repositories for Multi-Agent-VQA
Users that are interested in Multi-Agent-VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [WACV 2025] Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge☆40Oct 29, 2024Updated last year
- ☆17Dec 13, 2023Updated 2 years ago
- ☆13Mar 14, 2025Updated last year
- [CVPR'2022 Oral] The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation☆32Oct 19, 2023Updated 2 years ago
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)☆45Mar 28, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code and Dataset for our CVPR 2022 paper "Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training"☆12Jul 8, 2022Updated 3 years ago
- B站爬虫☆15Dec 10, 2023Updated 2 years ago
- Scene text rectification using glyph and character alignment properties☆22Jan 21, 2018Updated 8 years ago
- A generative model of compositionality in symmetric monoidal (Kleisli) categories☆12Oct 4, 2023Updated 2 years ago
- Weakly-Supervised Cell Tracking via Backward-and-Forward Propagation, in ECCV 2020☆11Aug 4, 2020Updated 5 years ago
- ☆20Apr 22, 2022Updated 4 years ago
- A test for RL application on f1tenth gym environment☆11Apr 10, 2023Updated 3 years ago
- CorrNet+: Sign Language Recognition and Translation via Spatial-Temporal Correlation☆37Jan 29, 2025Updated last year
- InstAttention: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference☆17Mar 30, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!☆28Jan 10, 2025Updated last year
- ☆12Sep 8, 2020Updated 5 years ago
- Counterfactual Reasoning VQA Dataset☆28Nov 23, 2023Updated 2 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- This is the official repository for the paper "Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World"…☆49Mar 12, 2024Updated 2 years ago
- Official implementation of TagAlign☆37Dec 11, 2024Updated last year
- Python scripts for tracking cells in fluorescent microscopy.☆11Dec 10, 2017Updated 8 years ago
- Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)☆49Apr 22, 2026Updated last month
- Official implementation of "MedITok: A Unified Tokenizer for Medical Image Synthesis and Interpretation"☆28Apr 3, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆11Dec 20, 2024Updated last year
- PyTorch implementation of the paper "SuperLoss: A Generic Loss for Robust Curriculum Learning" in NIPS 2020.☆29Jan 26, 2021Updated 5 years ago
- Weak conditional diffusion for domain adaptation☆10Nov 4, 2024Updated last year
- [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training☆41Mar 27, 2025Updated last year
- ☆10Aug 22, 2020Updated 5 years ago
- Explaining Autonomous Driving Actions with Visual Question Answering (IEEE ITSC-2023)☆19Feb 15, 2024Updated 2 years ago
- ☆11Jun 21, 2025Updated 11 months ago
- This code was submitted to Cell Tracking Challenge, ISBI 2020.☆14May 19, 2021Updated 5 years ago
- [NeurIPS 25] Hyperbolic Contrastive Regularisation for Geometrically Aware Sign Language Translation☆25Nov 26, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for paper "Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation"☆39Apr 8, 2026Updated last month
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- ☆12Mar 8, 2021Updated 5 years ago
- A Deep Learning-Based Smartphone App for Real-Time Detection of Retinal Abnormalities in Fundus Images☆11Mar 11, 2020Updated 6 years ago
- ☆15Nov 17, 2023Updated 2 years ago
- Visualization of the PCA as shown in Figure 1.☆45Jan 14, 2024Updated 2 years ago
- This repository is about our work "A Three-Stage Self-Training Framework for Semi-Supervised Semantic Segmentation"☆20Jul 4, 2022Updated 3 years ago