pedro-morgado/spatialaudiogen

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pedro-morgado/spatialaudiogen)

pedro-morgado / spatialaudiogen

Spatial Audio Generation

☆117

Alternatives and similar repositories for spatialaudiogen

Users that are interested in spatialaudiogen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / 2.5D-Visual-Sound
View on GitHub
2.5D visual sound
☆121Jul 25, 2023Updated 3 years ago
jaeyeonkim99 / visage
View on GitHub
Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)
☆47Sep 10, 2025Updated 10 months ago
SheldonTsui / SepStereo_ECCV2020
View on GitHub
Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)
☆72Oct 20, 2020Updated 5 years ago
pedro-morgado / AVSpatialAlignment
View on GitHub
☆31Jun 14, 2022Updated 4 years ago
facebookresearch / FAIR-Play
View on GitHub
2.5D visual sound dataset
☆108Sep 21, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
V-Sense / 360AudioVisual
View on GitHub
This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality
☆13Jul 2, 2019Updated 7 years ago
SheldonTsui / PseudoBinaural_CVPR2021
View on GitHub
Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)
☆72Jul 8, 2021Updated 5 years ago
Audio-Experience-Design / LAPChallenge
View on GitHub
The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.
☆16Aug 12, 2025Updated 11 months ago
sh01k / MeshRIR
View on GitHub
MeshRIR: Dataset of room impulse responses on meshed grid points
☆43Mar 13, 2026Updated 4 months ago
aluo-x / Learning_Neural_Acoustic_Fields
View on GitHub
Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)
☆167Jan 20, 2024Updated 2 years ago
facebookresearch / BinauralSpeechSynthesis
View on GitHub
N/A
☆190May 19, 2022Updated 4 years ago
facebookresearch / VisualEchoes
View on GitHub
VisualEchoes Dataset (ECCV 2020)
☆37Aug 31, 2021Updated 4 years ago
chris-hld / spaudiopy
View on GitHub
Spatial Audio Python Package
☆201Jun 28, 2026Updated 3 weeks ago
facebookresearch / A2B
View on GitHub
A2B Neural Rendering of Ambisonic Recordings to Binaural
☆20Aug 5, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Benjamin-Tsui / HRTF_preprocessing
View on GitHub
HRTF data preparation for machine learning by finding common measurement angles
☆12May 14, 2019Updated 7 years ago
HS-YN / PanoAVQA
View on GitHub
Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)
☆16Oct 12, 2021Updated 4 years ago
MRSAudio / MRSAudio_Main
View on GitHub
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations
☆43Oct 15, 2025Updated 9 months ago
sh01k / AmplitudeMatching
View on GitHub
A multizone sound field control method to synthesize a desired amplitude (or magnitude) distributions over a target region with multiple …
☆15Mar 30, 2023Updated 3 years ago
liangsusan-git / AV-NeRF
View on GitHub
[NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis
☆36Feb 15, 2024Updated 2 years ago
facebookresearch / learning-audio-visual-dereverberation
View on GitHub
Code for paper Learning Audio-Visual Dereverberation
☆32Aug 10, 2022Updated 3 years ago
YuriWayne42 / hrtf_sht_personalization
View on GitHub
the code for 'Global HRTF Personalization Using Anthropometric Measures'(AES 150th convention)
☆36Jul 24, 2022Updated 4 years ago
krantiparida / awesome-audio-visual
View on GitHub
A curated list of different papers and datasets in various areas of audio-visual processing
☆775Jan 30, 2024Updated 2 years ago
adobe-research / deep-acoustic-analysis
View on GitHub
☆26Jan 18, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / sound-spaces
View on GitHub
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple task…
☆468Sep 29, 2023Updated 2 years ago
lzhangbj / ASVA
View on GitHub
[ECCV 2024 Oral] Audio-Synchronized Visual Animation
☆60Mar 15, 2026Updated 4 months ago
cozcinar / 360_Audio_Visual_ICMEW2020
View on GitHub
Audio-Visual Perception of Omnidirectional Video for Virtual Reality Applications
☆15Feb 22, 2023Updated 3 years ago
rhgao / co-separation
View on GitHub
Co-Separating Sounds of Visual Objects (ICCV 2019)
☆98Jul 25, 2023Updated 3 years ago
ardasnck / learning_to_localize_sound_source
View on GitHub
Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes
☆102Dec 4, 2024Updated last year
rohitrango / objects-that-sound
View on GitHub
Unofficial Implementation of Google Deepmind's paper `Objects that Sound`
☆83May 7, 2018Updated 8 years ago
arunbalajeev / binaural-sound-perception
View on GitHub
Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds
☆20Dec 18, 2021Updated 4 years ago
ilpoviertola / V-AURA
View on GitHub
The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)
☆35Feb 11, 2026Updated 5 months ago
see2sound / see2sound
View on GitHub
Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
☆141Mar 28, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sony / audio-visual-seld-dcase2023
View on GitHub
Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge
☆68Mar 19, 2025Updated last year
SAGNIKMJR / ego-AV-spatial-correspondence
View on GitHub
[CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'
☆14Jun 16, 2024Updated 2 years ago
XYPB / CondFoleyGen
View on GitHub
Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".
☆93Dec 8, 2023Updated 2 years ago
WikiChao / DAVIS
View on GitHub
[🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …
☆33Mar 30, 2026Updated 3 months ago
karreny / telling-left-from-right
View on GitHub
Project website for "Telling left from right: Learning spatial correspondence between sight and sound"
☆29Jun 6, 2022Updated 4 years ago
stoneMo / SLAVC
View on GitHub
Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)
☆22Dec 6, 2022Updated 3 years ago
marl / SpatialScaper
View on GitHub
☆75Aug 7, 2025Updated 11 months ago