facebookresearch/visual-acoustic-matching

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/visual-acoustic-matching)

facebookresearch / visual-acoustic-matching

Repo for Visual Acoustic Matching, CVPR 2022

☆71

Alternatives and similar repositories for visual-acoustic-matching

Users that are interested in visual-acoustic-matching are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / learning-audio-visual-dereverberation
View on GitHub
Code for paper Learning Audio-Visual Dereverberation
☆32Aug 10, 2022Updated 3 years ago
facebookresearch / novel-view-acoustic-synthesis
View on GitHub
Code for Novel View Acoustic Synthesis paper
☆54Aug 14, 2023Updated 2 years ago
SAGNIKMJR / move2hear-active-AV-separation
View on GitHub
Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)
☆16Jun 17, 2026Updated last month
facebookresearch / VisualEchoes
View on GitHub
VisualEchoes Dataset (ECCV 2020)
☆37Aug 31, 2021Updated 4 years ago
anton-jeran / AV-RIR
View on GitHub
Audio-Visual Room Impulse Response Estimation
☆25Jul 22, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Chutlhu / dEchorate
View on GitHub
Da - ECHO - RetrievAl - daTasEt
☆36Jul 7, 2024Updated 2 years ago
SheldonTsui / PseudoBinaural_CVPR2021
View on GitHub
Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)
☆72Jul 8, 2021Updated 5 years ago
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago
LIMUNIMI / Real-time-SDN
View on GitHub
Scattering delay networks plugin in Juce
☆34Feb 24, 2026Updated 5 months ago
Curly-Mo / crosstalk_cancellation
View on GitHub
Apply crosstalk cancellation to a binaural audio file
☆15Jul 7, 2016Updated 10 years ago
facebookresearch / 2.5D-Visual-Sound
View on GitHub
2.5D visual sound
☆121Jul 25, 2023Updated 3 years ago
anton-jeran / FAST-RIR
View on GitHub
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…
☆182Mar 19, 2026Updated 4 months ago
facebookresearch / replay_dataset
View on GitHub
Download scripts and tools for Replay dataset.
☆39Jun 23, 2023Updated 3 years ago
anton-jeran / MESH2IR
View on GitHub
This is the official implementation of our mesh-based neural network (MESH2IR) to generate acoustic impulse responses (IRs) for indoor 3D…
☆110Jul 24, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Franklin905 / VALOR
View on GitHub
Research code for NeurIPS 2023 paper "Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser"
☆17Jul 13, 2025Updated last year
stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago
RuttenStijn / Thesis
View on GitHub
Code and extra figures as part of the thesis about Relative transfer function estimation for multi-microphone speech enhancement based on…
☆11Jan 10, 2018Updated 8 years ago
apple / ml-nvas3d
View on GitHub
☆49Jul 20, 2024Updated 2 years ago
AmandineBtto / Batvision-Dataset
View on GitHub
A large-scale real-world audio-visual dataset for research on 3D scene understanding and echolocation.
☆22Oct 21, 2025Updated 9 months ago
SheldonTsui / SepStereo_ECCV2020
View on GitHub
Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)
☆72Oct 20, 2020Updated 5 years ago
aispeech-lab / advr-avss
View on GitHub
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆18Jul 11, 2022Updated 4 years ago
yyf17 / SAAVN
View on GitHub
SAAVN Code release for paper "Sound Adversarial Audio-Visual Navigation,ICLR2022" (In PyTorch)
☆21Nov 9, 2022Updated 3 years ago
facebookresearch / sound-spaces
View on GitHub
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple task…
☆468Sep 29, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ku-vai / TPoS
View on GitHub
This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)
☆25Dec 7, 2023Updated 2 years ago
krantiparida / beyond-image-to-depth
View on GitHub
☆38Jun 29, 2021Updated 5 years ago
SAGNIKMJR / few-shot-rir
View on GitHub
Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)
☆24Jun 16, 2026Updated last month
facebookresearch / soundspaces-challenge
View on GitHub
Starter code for SoundSpaces challenge at CVPR 21's Embodied AI workshop
☆16Mar 2, 2023Updated 3 years ago
sony / creativeai
View on GitHub
☆79Jul 7, 2026Updated 3 weeks ago
GAMMA-UMD / Fast3DScattering-release
View on GitHub
Repo for our research paper "Learning Acoustic Scattering Fields for Dynamic Interactive Sound Propagation"
☆17Apr 6, 2021Updated 5 years ago
nikhilsinghmus / image2reverb
View on GitHub
[ICCV 2021] Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis.
☆91Oct 12, 2021Updated 4 years ago
facebookresearch / 6DoF-Auraliser
View on GitHub
An auralisation system that takes a head-worn microphone array recordings as input and renders the audio for binaural playback; taking in…
☆37Oct 10, 2023Updated 2 years ago
pierreguillot / vbap
View on GitHub
Vector Base Amplitude Panning
☆21Oct 21, 2017Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
lsg1213 / PEAQ_python
View on GitHub
Python version of PEAQ(Perceptual Evaluation of Audio Quality)
☆14Jul 24, 2025Updated last year
zaocan666 / DyViSE
View on GitHub
Dynamic vision-guided speaker embedding for audio-visual speaker diarization
☆12Jul 5, 2022Updated 4 years ago
Audio-WestlakeU / VINP
View on GitHub
Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverb…
☆36Feb 23, 2026Updated 5 months ago
IFICL / SLfM
View on GitHub
Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation
☆43Jul 16, 2026Updated last week
EvelynZhou / FAST-RIR
View on GitHub
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…
☆12Nov 30, 2021Updated 4 years ago
zengchang233 / xiaoicesing2
View on GitHub
The source code for the paper XiaoiceSing2 (interspeech2023)
☆49Jan 15, 2024Updated 2 years ago
jonashaag / RealRIRs
View on GitHub
Python loaders for many Real Room Impulse Response databases
☆97Sep 30, 2024Updated last year