mira-ai-lab/MUSIC-AVQA-R

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mira-ai-lab/MUSIC-AVQA-R)

mira-ai-lab / MUSIC-AVQA-R

☆13

Alternatives and similar repositories for MUSIC-AVQA-R

Users that are interested in MUSIC-AVQA-R are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GeWu-Lab / TSPM
View on GitHub
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
☆17Oct 25, 2024Updated last year
GeWu-Lab / MUSIC-AVQA
View on GitHub
MUSIC-AVQA, CVPR2022 (ORAL)
☆100Dec 30, 2022Updated 3 years ago
AlyssaYoung / AVQA
View on GitHub
ACM MM 2022 paper_AVQA: A Dataset for Audio-Visual Question Answering on Videos
☆15Aug 17, 2023Updated 2 years ago
SaeedGooda / Top-Tech-Companies-Interview-Problems
View on GitHub
☆12Nov 3, 2024Updated last year
mido3ds / egypt-summer2020internships
View on GitHub
Keep track of internships for Summer 2020 for undergraduates interested in tech./SWE/related fields
☆11Feb 15, 2020Updated 6 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
zzhhfut / CCNet-AAAI2025
View on GitHub
This repository contains code for AAAI2025 paper "Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal …
☆24Aug 18, 2025Updated 11 months ago
stoneMo / CIGN
View on GitHub
Official implementation for CIGN
☆17Sep 11, 2023Updated 2 years ago
mira-ai-lab / DoG
View on GitHub
☆25Apr 15, 2025Updated last year
bmcfee / ccrma2018_notebooks
View on GitHub
Extra notebooks for CCRMA MIR workshop, 2018 edition
☆13Jun 28, 2018Updated 8 years ago
jasongief / OV-AVEL
View on GitHub
[2025 CVPR] Towards Open-Vocabulary Audio-Visual Event Localization
☆46Mar 7, 2025Updated last year
kaiw7 / STG-CMA
View on GitHub
Towards Efficient Audio-Visual Learners via Empowering Pre-trained Vision Transformers with Cross-Modal Adaptation
☆15Apr 13, 2024Updated 2 years ago
GeWu-Lab / PSTP-Net
View on GitHub
☆17Aug 11, 2023Updated 2 years ago
lzy7976 / union-set-model-adaptation
View on GitHub
Union-set Multi-source Model Adaptation for Semantic Segmentation
☆12Oct 24, 2022Updated 3 years ago
ml-postech / selective-generation
View on GitHub
☆11Dec 8, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
swarupbehera / awesome-audio-visual-question-answering
View on GitHub
A curated list of resources in audio visual question answering and related area. :-)
☆17Jun 29, 2025Updated last year
epic-kitchens / VISOR-VIS
View on GitHub
Visualisation of VISOR Segmentations with Annotations and Relations
☆22Aug 15, 2022Updated 3 years ago
GeWu-Lab / BML_TPAMI2024
View on GitHub
The repo for "On-the-fly Modulation for Balanced Multimodal Learning", T-PAMI 2024
☆19Sep 29, 2024Updated last year
AIM-SKKU / QA-TIGER
View on GitHub
Question-Aware Gaussian Experts for Audio-Visual Question Answering -- Official Pytorch Implementation (CVPR'25, Highlight)
☆29Jun 6, 2025Updated last year
md-mohaiminul / BIMBA
View on GitHub
☆29Jul 25, 2025Updated last year
Chunmian-art / City-3DQA
View on GitHub
☆23Apr 19, 2024Updated 2 years ago
VisualAIKHU / Missing-AVQA
View on GitHub
Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)
☆16Oct 29, 2024Updated last year
DILab-USTCSZ / CMuST
View on GitHub
[NeurIPS 2024 Oral] Repository of the CMuST paper: "Get Rid of Isolation: A Continuous Multi-task Spatio-Temporal Learning Framework"
☆15Mar 12, 2025Updated last year
FengHZ / CoSDA
View on GitHub
The official implementation of our work CoSDA: Continual Source-Free Domain Adaptation.
☆45Feb 24, 2026Updated 5 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
llm-conditioned-diffusion / OmniDiffusion
View on GitHub
☆14Jul 17, 2024Updated 2 years ago
ExplainableML / AVCA-GZSL
View on GitHub
This repository contains the code for our CVPR 2022 paper on "Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and …
☆43Nov 29, 2022Updated 3 years ago
Zhang-VISLab / NeurIPS2023-InfoCD
View on GitHub
The official repository of the paper "InfoCD: A Contrastive Chamfer Distance Loss for Point Cloud Completion" published at NeurIPS 2023
☆23Oct 13, 2023Updated 2 years ago
facebookresearch / daqa
View on GitHub
Temporal Reasoning via Audio Question Answering
☆27Dec 21, 2019Updated 6 years ago
vscomputer / chuck-examples
View on GitHub
Example code to help people follow along with the tutorials
☆25Aug 21, 2024Updated last year
JHome1 / GiO-GiT
View on GitHub
☆18Sep 29, 2025Updated 9 months ago
tuyunbin / Review-of-Change-Captioning
View on GitHub
This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.
☆17Sep 2, 2025Updated 10 months ago
zinuoli / TriSense
View on GitHub
[NeurIPS 2025] Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM
☆27Feb 10, 2026Updated 5 months ago
adxcreative / D-M
View on GitHub
The official source code of our AAAI25 paper "D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matchin…
☆10Feb 9, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tmallet / expo-proximity
View on GitHub
Provides access to the system's proximity sensor.
☆26Jan 1, 2025Updated last year
human-analysis / FairerCLIP
View on GitHub
Official code for the paper "FairerCLIP: Debiasing CLIP’s Zero-Shot Predictions using Functions in RKHSs".
☆16Oct 14, 2025Updated 9 months ago
forwchen / mfcc_boaw
View on GitHub
Extract MFCCs from videos and make bag-of-audio-words (BOAW) representations.
☆11Dec 20, 2018Updated 7 years ago
mira-ai-lab / Deliberation-on-Priors
View on GitHub
☆39May 22, 2025Updated last year
Mr-Neko / JM3D
View on GitHub
The offical implemention of JM3D.
☆31Apr 8, 2026Updated 3 months ago
tuyunbin / SRDRL
View on GitHub
[ACL 2021] This is the Pytorch code for our paper "Semantic Relation-aware Difference Representation Learning for Change Captioning".
☆13Jan 16, 2022Updated 4 years ago
beasteers / singuconda
View on GitHub
go binary for setting up singularity containers with a miniconda
☆21Feb 3, 2026Updated 5 months ago