OpenNLPLab/MMVAE-AVS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenNLPLab/MMVAE-AVS)

OpenNLPLab / MMVAE-AVS

Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].

☆20

Alternatives and similar repositories for MMVAE-AVS

Users that are interested in MMVAE-AVS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lxa9867 / QSD
View on GitHub
[CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"
☆12Feb 27, 2024Updated 2 years ago
yannqi / COMBO-AVS
View on GitHub
[CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…
☆40Apr 20, 2025Updated last year
GeWu-Lab / Generalizable-Audio-Visual-Segmentation
View on GitHub
Official repository of "Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer", AAAI 2024
☆28Mar 14, 2026Updated 4 months ago
OpenNLPLab / FNAC_AVL
View on GitHub
[CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…
☆29Apr 10, 2023Updated 3 years ago
hxixixh / mix-and-localize
View on GitHub
☆23Mar 20, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zjsong / SSPL
View on GitHub
PyTorch code for "Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes" (CVPR, 2022…
☆32Jul 8, 2024Updated 2 years ago
fwt-team / DSVAE
View on GitHub
Deep Clustering Analysis via Dual Variational Autoencoder with Spherical Latent Embeddings powered by@torch
☆12Dec 12, 2021Updated 4 years ago
marmot-xy / CMBS
View on GitHub
cross modal background suppression for audio-visual event localization
☆36Mar 18, 2022Updated 4 years ago
SitongGong / Veason-R1
View on GitHub
Official code of Veason-R1
☆15Jul 14, 2026Updated last week
GeWu-Lab / Stepping-Stones
View on GitHub
The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
☆18Oct 11, 2024Updated last year
justsmart / RecFormer
View on GitHub
Code for "Information Recovery-Driven Deep Incomplete Multi-view Clustering Network"
☆13Oct 9, 2024Updated last year
praneethmurthy / NORST
View on GitHub
MATLAB implementation of "Nearly Optimal Robust Subspace Tracking", ICML 2018. Longer version to appear in IEEE Journal of Selected Areas…
☆11May 27, 2020Updated 6 years ago
vvvb-github / AVSegFormer
View on GitHub
[AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer
☆74Mar 6, 2025Updated last year
GeWu-Lab / Ref-AVS
View on GitHub
The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024
☆50Oct 12, 2025Updated 9 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
stoneMo / AVGN
View on GitHub
Official implementation for AVGN
☆41Mar 24, 2023Updated 3 years ago
GenjiB / LAVISH
View on GitHub
Vision Transformers are Parameter-Efficient Audio-Visual Learners
☆107Aug 11, 2023Updated 2 years ago
pedro-morgado / AVSpatialAlignment
View on GitHub
☆31Jun 14, 2022Updated 4 years ago
XLearning-SCU / 2023-CVPR-FCMI
View on GitHub
☆21Jun 3, 2023Updated 3 years ago
YuliangXiu / TeCH
View on GitHub
[3DV 2024] Official repo of "TeCH: Text-guided Reconstruction of Lifelike Clothed Humans"
☆10May 15, 2024Updated 2 years ago
zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
Rudrabha / 8X-Super-Resolution
View on GitHub
This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…
☆16Aug 26, 2020Updated 5 years ago
cyh-0 / CAVP
View on GitHub
Official code for "A Closer Look at Audio-Visual Segmentation"
☆97Oct 31, 2025Updated 8 months ago
karreny / telling-left-from-right
View on GitHub
Project website for "Telling left from right: Learning spatial correspondence between sight and sound"
☆29Jun 6, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Overcautious / ADENet
View on GitHub
Accepted by TMM 2022
☆19Aug 18, 2022Updated 3 years ago
rjiang03 / HCL-OT
View on GitHub
☆11Apr 10, 2023Updated 3 years ago
HS-YN / PanoAVQA
View on GitHub
Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)
☆16Oct 12, 2021Updated 4 years ago
IFICL / stereocrw
View on GitHub
Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimation
☆28Mar 15, 2023Updated 3 years ago
GeWu-Lab / awesome-audiovisual-learning
View on GitHub
A curated list of audio-visual learning methods and datasets.
☆288Dec 3, 2024Updated last year
sony / audio-visual-seld-dcase2023
View on GitHub
Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge
☆68Mar 19, 2025Updated last year
sunyuan-cs / 2024-TKDE-RMCNC
View on GitHub
About PyTorch implementation for ‘’Robust Multi-View Clustering with Noisy Correspondence‘’ (TKDE 2024)
☆11Aug 2, 2024Updated last year
Dshijie / DMVG
View on GitHub
Diffusion-based Missing-view Generation With the Application on Incomplete Multi-view Clustering
☆10May 26, 2024Updated 2 years ago
thomassutter / MoPoE
View on GitHub
Code release for ICLR 2021 paper "Generalised Multimodal ELBO"
☆42Nov 12, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
haoyi-duan / DG-SCT
View on GitHub
NeurIPS'2023 official implementation code
☆70Nov 11, 2023Updated 2 years ago
stoneMo / MGN
View on GitHub
Official implementation for MGN
☆20Dec 22, 2022Updated 3 years ago
DavideNardone / Blind-Source-Separation-using-Dictionary-Learning
View on GitHub
A model for Blind Source Separation using Dictionary Learning
☆13Sep 30, 2019Updated 6 years ago
cyh-0 / BoMD
View on GitHub
Official code for "BoMD: Bag of Multi-label Descriptors for Noisy Chest X-ray Classification"
☆26Apr 11, 2024Updated 2 years ago
soumith / rgbd_streamer
View on GitHub
☆15Feb 28, 2022Updated 4 years ago
ku-vai / TPoS
View on GitHub
This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)
☆25Dec 7, 2023Updated 2 years ago
WikiChao / Ego-AV-Loc
View on GitHub
[CVPR 2023] Egocentric Audio-Visual Object Localization
☆27Jan 6, 2024Updated 2 years ago