[CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering
☆20Sep 21, 2024Updated last year
Alternatives and similar repositories for Multi-Agent-VQA
Users that are interested in Multi-Agent-VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [WACV 2025] Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge☆40Oct 29, 2024Updated last year
- [CVPR 23] Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!☆17May 14, 2024Updated last year
- ☆17Dec 13, 2023Updated 2 years ago
- ☆13Mar 14, 2025Updated last year
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)☆44Mar 28, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR'2022 Oral] The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation☆33Oct 19, 2023Updated 2 years ago
- MICCAI 2022 MELA Challenge: Mediastinal Lesion Analysis (3D Detection)☆11Jun 30, 2022Updated 3 years ago
- B站爬虫☆15Dec 10, 2023Updated 2 years ago
- Weakly-Supervised Cell Tracking via Backward-and-Forward Propagation, in ECCV 2020☆11Aug 4, 2020Updated 5 years ago
- Offical Code of MICCAI'25 Best-Paper-Shortlist paper "MedGround-R1: Advancing Medical Image Grounding via Spatial-Semantic Rewarded Group…☆38Sep 28, 2025Updated 6 months ago
- ☆16Jan 23, 2018Updated 8 years ago
- [NeurIPS 2023] LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition☆19May 26, 2024Updated last year
- A real-time video understanding foundation model built on Llama-3.2-Vision, featuring comprehensively extended video processing and multi…☆135Updated this week
- CorrNet+: Sign Language Recognition and Translation via Spatial-Temporal Correlation☆35Jan 29, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- InstAttention: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference☆16Mar 30, 2025Updated last year
- ☆12Sep 8, 2020Updated 5 years ago
- Counterfactual Reasoning VQA Dataset☆28Nov 23, 2023Updated 2 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- This is the official repository for the paper "Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World"…☆48Mar 12, 2024Updated 2 years ago
- Codebase for AAAI 2024 conference paper Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning☆39Mar 12, 2025Updated last year
- Implementation of ResiDualGAN and DRDG☆14Apr 15, 2024Updated 2 years ago
- Official implementation of TagAlign☆37Dec 11, 2024Updated last year
- Python scripts for tracking cells in fluorescent microscopy.☆11Dec 10, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyTorch implementation of the paper "SuperLoss: A Generic Loss for Robust Curriculum Learning" in NIPS 2020.☆29Jan 26, 2021Updated 5 years ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…☆37Apr 25, 2021Updated 4 years ago
- [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training☆41Mar 27, 2025Updated last year
- ☆20Oct 22, 2024Updated last year
- [EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering☆17Oct 31, 2024Updated last year
- ☆11Jun 21, 2025Updated 9 months ago
- This code was submitted to Cell Tracking Challenge, ISBI 2020.☆14May 19, 2021Updated 4 years ago
- Code for paper "Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation"☆40Apr 8, 2026Updated last week
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Visualization of the PCA as shown in Figure 1.☆44Jan 14, 2024Updated 2 years ago
- ☆12Mar 8, 2021Updated 5 years ago
- A Deep Learning-Based Smartphone App for Real-Time Detection of Retinal Abnormalities in Fundus Images☆11Mar 11, 2020Updated 6 years ago
- ☆15Nov 17, 2023Updated 2 years ago
- ☆12Jan 10, 2025Updated last year
- ☆17Apr 15, 2025Updated last year
- A GUI tool to interact with X-RAY, MRI and CT scans.☆17Apr 11, 2017Updated 9 years ago