[AAAI 24] Official Codebase for BridgeQA: Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA
☆27Jul 12, 2024Updated last year
Alternatives and similar repositories for BridgeQA
Users that are interested in BridgeQA are comparing it to the libraries listed below
Sorting:
- ☆152Aug 23, 2023Updated 2 years ago
- ☆12May 19, 2025Updated 9 months ago
- official code for "3D Question Answering via only 2D Vision-Language Models"☆23Updated this week
- [NeurIPS 2024] MSR3D: Advanced Situated Reasoning in 3D Scenes☆70Dec 2, 2025Updated 3 months ago
- ☆14May 25, 2021Updated 4 years ago
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆39Jun 9, 2025Updated 8 months ago
- ☆44Mar 27, 2023Updated 2 years ago
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆41Mar 23, 2024Updated last year
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆18Jan 11, 2026Updated last month
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆84Aug 2, 2024Updated last year
- ☆12Dec 12, 2024Updated last year
- Official implementation of `Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning`, CVPR 2025☆13Aug 1, 2025Updated 7 months ago
- This is for the AI enzyme design course☆13Nov 10, 2025Updated 3 months ago
- Goal of this project is to build Classification Decision Trees and Regression Decision trees without using any Machine learning libraries☆10Dec 28, 2018Updated 7 years ago
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆278Mar 19, 2025Updated 11 months ago
- ☆12Jul 22, 2024Updated last year
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated last month
- ☆15Jan 25, 2025Updated last year
- ☆12Dec 20, 2024Updated last year
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- Multi-task UNet for medical image classification and saliency prediction☆15Jan 29, 2022Updated 4 years ago
- ☆10May 4, 2018Updated 7 years ago
- ☆12Apr 24, 2024Updated last year
- A biological dual-language foundation model☆12Jun 16, 2025Updated 8 months ago
- Library for automatic time series forecasting based on ARIMA models☆12May 14, 2017Updated 8 years ago
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- Code base for publication: Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems☆10Feb 1, 2023Updated 3 years ago
- Application of OpenAI tools such as Whisper, DALL-E, and ChatGPT to generate album covers from audio☆12May 31, 2023Updated 2 years ago
- [npj Digital Medicine] A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis☆17Feb 6, 2025Updated last year
- Tracking (bio)medical imaging datasets☆19Apr 13, 2024Updated last year
- ☆12Jan 10, 2025Updated last year
- This repository provides the materials used in "Unsupervised Melody-to-Lyric Generation" by Yufei Tian, Anjali Narayan-Chen, Shereen Orab…☆11Jul 6, 2023Updated 2 years ago
- Groq-powered MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆12Jul 5, 2024Updated last year
- Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)☆16Oct 29, 2024Updated last year
- ☆11Apr 30, 2022Updated 3 years ago
- ☆10Jul 23, 2021Updated 4 years ago
- Segment graph convolutional neural network for relation classification. Paper in JAMIA.☆10May 13, 2019Updated 6 years ago
- ☆12Jul 5, 2024Updated last year
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024☆11Nov 7, 2024Updated last year