VisualAIKHU/Missing-AVQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VisualAIKHU/Missing-AVQA)

VisualAIKHU / Missing-AVQA

Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)

☆16

Alternatives and similar repositories for Missing-AVQA

Users that are interested in Missing-AVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

VisualAIKHU / SIRA-SSL
View on GitHub
Official Repository for "Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization" (ACM MM 2023)
☆18Nov 14, 2023Updated 2 years ago
GeWu-Lab / PSTP-Net
View on GitHub
☆17Aug 11, 2023Updated 2 years ago
jasongief / OV-AVEL
View on GitHub
[2025 CVPR] Towards Open-Vocabulary Audio-Visual Event Localization
☆46Mar 7, 2025Updated last year
GeWu-Lab / BML_TPAMI2024
View on GitHub
The repo for "On-the-fly Modulation for Balanced Multimodal Learning", T-PAMI 2024
☆19Sep 29, 2024Updated last year
zzhhfut / CCNet-AAAI2025
View on GitHub
This repository contains code for AAAI2025 paper "Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal …
☆24Aug 18, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
genandlam / multi-modal-depression-detection
View on GitHub
Official codebase for "Context Aware Deep Learning for Multi Modal Depression Detection" [ICASSP 2019, Oral]
☆11Dec 26, 2024Updated last year
sangminwoo / ActionMAE
View on GitHub
[AAAI 2023 Oral] Official pytorch implementation of "Towards Good Practices for Missing Modality Robust Action Recognition"
☆23Dec 1, 2022Updated 3 years ago
ruohaoguo / avis
View on GitHub
[CVPR 2025] 🔥 Official impl. of "Audio-Visual Instance Segmentation".
☆52Jun 5, 2025Updated last year
GeWu-Lab / InfoReg_CVPR2025
View on GitHub
This is the repo for "Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition", CVPR2025.
☆24Dec 22, 2025Updated 7 months ago
GeWu-Lab / MUSIC-AVQA
View on GitHub
MUSIC-AVQA, CVPR2022 (ORAL)
☆100Dec 30, 2022Updated 3 years ago
chengzju / CARAT
View on GitHub
☆25Apr 16, 2025Updated last year
mira-ai-lab / MUSIC-AVQA-R
View on GitHub
☆13May 21, 2024Updated 2 years ago
mdswyz / IMDer
View on GitHub
An official implementation of "Incomplete Multimodality-Diffused Emotion Recognition" in PyTorch. (NeurIPS 2023)
☆64Dec 5, 2023Updated 2 years ago
billhhh / ShaSpec
View on GitHub
The official code repository of ShaSpec model from CVPR 2023 [paper](https://arxiv.org/pdf/2307.14126) "Multi-modal Learning with Missing…
☆101Apr 16, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
yuntaeyang / TelME
View on GitHub
☆36Jul 25, 2024Updated 2 years ago
GeWu-Lab / TSPM
View on GitHub
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
☆17Oct 25, 2024Updated last year
MinaJf / LMISA
View on GitHub
A Lightweight Multi-modality Image Segmentation Network via Domain Adaptation using Gradient Magnitude and Shape Constraint
☆10Apr 3, 2023Updated 3 years ago
dingchaoyue / AcFormer
View on GitHub
☆29Aug 2, 2023Updated 2 years ago
bowen-upenn / Multi-Agent-VQA
View on GitHub
[CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering
☆22Sep 21, 2024Updated last year
xxayt / MGSV
View on GitHub
[ICCV 2025] This repo is the official implementation of "Music Grounding by Short Video"
☆27Sep 9, 2025Updated 10 months ago
Chunmian-art / City-3DQA
View on GitHub
☆23Apr 19, 2024Updated 2 years ago
HeranYang / hyper-GAE
View on GitHub
The official Tensorflow implementation of the paper "Learning Unified Hyper-network for Multi-modal MR Image Synthesis and Tumor Segmenta…
☆14Oct 5, 2023Updated 2 years ago
ZhuoYulang / CIF-MMIN
View on GitHub
☆41Apr 16, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
katerynaCh / MMA-DFER
View on GitHub
This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation …
☆57Sep 16, 2024Updated last year
kylehkhsu / tripod
View on GitHub
☆12Apr 19, 2024Updated 2 years ago
yyyanbj / FedCM
View on GitHub
[IJCNN 2021] FedCM: A Real-time Contribution Measurement Method for Participants in Federated Learning
☆11Aug 21, 2021Updated 4 years ago
Burf / SwinTransformer-Tensorflow2
View on GitHub
SwinTransformer for Tensorflow2
☆11Jul 7, 2022Updated 4 years ago
MediaBrain-SJTU / GPFL-GRACE
View on GitHub
[MICCAI 2023] GRACE: Enhancing Federated Learning for Medical Imaging with Generalized and Personalized Gradient Correction
☆17Jun 29, 2023Updated 3 years ago
yoxu515 / VIPOSeg-Benchmark
View on GitHub
The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".
☆12Oct 17, 2023Updated 2 years ago
murufeng / knowledge_distillation
View on GitHub
一款即插即用的知识蒸馏工具包
☆13May 16, 2022Updated 4 years ago
idansc / simple-avsd
View on GitHub
Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``
☆27May 26, 2020Updated 6 years ago
Zhang-VISLab / NeurIPS2023-InfoCD
View on GitHub
The official repository of the paper "InfoCD: A Contrastive Chamfer Distance Loss for Point Cloud Completion" published at NeurIPS 2023
☆23Oct 13, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
AIM-SKKU / RA-Touch
View on GitHub
RA-Touch: Retrieval-Augmented Touch Understanding with Enriched Visual Data (ACM MM '25)
☆15Sep 12, 2025Updated 10 months ago
JHome1 / GiO-GiT
View on GitHub
☆18Sep 29, 2025Updated 9 months ago
SpeechEE / SpeechEE
View on GitHub
☆11Aug 20, 2025Updated 11 months ago
SSyangguang / MEF-freq
View on GitHub
Code for A Dual Domain Multi-exposure Image Fusion Network Based on the Spatial-frequency Integration.
☆12Jul 25, 2024Updated 2 years ago
Wanderlust717 / CARGNet
View on GitHub
[TGRS 2023] Point Label Meets Remote Sensing Change Detection: A Consistency-Aligned Regional Growth Network
☆15Jan 5, 2024Updated 2 years ago
deepsuperviser / CTFN
View on GitHub
This is the code for Coupled-translation Fusion Network.
☆11Dec 2, 2021Updated 4 years ago
lartpang / UltraHighResolution
View on GitHub
Papers about the ultra high resolution tasks.
☆13Jul 12, 2024Updated 2 years ago