mira-ai-lab / MUSIC-AVQA-RView external linksLinks
☆13May 21, 2024Updated last year
Alternatives and similar repositories for MUSIC-AVQA-R
Users that are interested in MUSIC-AVQA-R are comparing it to the libraries listed below
Sorting:
- Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.☆16Oct 25, 2024Updated last year
- ACM MM 2022 paper_AVQA: A Dataset for Audio-Visual Question Answering on Videos☆15Aug 17, 2023Updated 2 years ago
- MUSIC-AVQA, CVPR2022 (ORAL)☆94Dec 30, 2022Updated 3 years ago
- Official implementation for CIGN☆17Sep 11, 2023Updated 2 years ago
- Question-Aware Gaussian Experts for Audio-Visual Question Answering -- Official Pytorch Implementation (CVPR'25, Highlight)☆26Jun 6, 2025Updated 8 months ago
- ☆36Jul 9, 2025Updated 7 months ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 5, 2026Updated last week
- The official implementation of our work CoSDA: Continual Source-Free Domain Adaptation.☆44Aug 28, 2023Updated 2 years ago
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆45Dec 24, 2024Updated last year
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- a math-formula image recognition project which placed at the first place in a competition hosted by NAVER CONNECT boostcamp AI Tech☆10Dec 16, 2023Updated 2 years ago
- This is the official Pytorch code for our paper "Artemis: Structured Visual Reasoning for Perception Policy Learning".☆14Dec 4, 2025Updated 2 months ago
- The official source code of our AAAI25 paper "D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matchin…☆10Feb 9, 2025Updated last year
- Official Repository for "Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge" (CVPR 2024)☆13Sep 1, 2024Updated last year
- ☆11Dec 8, 2024Updated last year
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year
- Keep track of internships for Summer 2020 for undergraduates interested in tech./SWE/related fields☆10Feb 15, 2020Updated 5 years ago
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated last year
- This repository contains the code for our CVPR 2022 paper on "Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and …☆42Nov 29, 2022Updated 3 years ago
- [ACL 2021] This is the Pytorch code for our paper "Semantic Relation-aware Difference Representation Learning for Change Captioning".☆13Jan 16, 2022Updated 4 years ago
- ☆11May 7, 2022Updated 3 years ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- Union-set Multi-source Model Adaptation for Semantic Segmentation☆12Oct 24, 2022Updated 3 years ago
- The official repo of the paper "Cal-SFDA: Source-Free Domain-adaptive Semantic Segmentation with Differentiable Expected Calibration Erro…☆10Oct 29, 2023Updated 2 years ago
- Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)☆15Oct 29, 2024Updated last year
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Finding]"☆15Aug 27, 2025Updated 5 months ago
- ☆14Jul 1, 2024Updated last year
- [CVPR 2025] Official implementation of paper "Multi-Granularity Class Prototype Topology Distillation for Class-Incremental Source-Free …☆17Aug 26, 2025Updated 5 months ago
- MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering…☆13Feb 18, 2023Updated 2 years ago
- Data-enriching GAN for retrieving Representative Samples from aTrained Classifier☆14Sep 2, 2020Updated 5 years ago
- Code for "ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation" (NeurIPS 23)☆14Apr 12, 2024Updated last year
- Naver Boostcamp AI Tech Stage 3 : MRC (Machine Reading Comprehension)☆10Jun 10, 2021Updated 4 years ago
- [CVPR 2023] Cascade Evidential Learning for Open-world Weakly-supervised Temporal Action Localization☆12Jul 9, 2024Updated last year
- Official code for the paper "FairerCLIP: Debiasing CLIP’s Zero-Shot Predictions using Functions in RKHSs".☆16Oct 14, 2025Updated 4 months ago
- ☆12Nov 3, 2024Updated last year
- ☆12Nov 29, 2023Updated 2 years ago
- Official Pytorch Implementation for "TextToucher: Fine-Grained Text-to-Touch Generation" (AAAI 2025)☆18Jan 28, 2026Updated 2 weeks ago
- [NeurIPS 2024 Oral] Repository of the CMuST paper: "Get Rid of Isolation: A Continuous Multi-task Spatio-Temporal Learning Framework"☆15Mar 12, 2025Updated 11 months ago