facebookresearch / sound-spaces
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.
β362Updated last year
Related projects β
Alternatives and complementary repositories for sound-spaces
- Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)β130Updated 10 months ago
- π Repository for our NAACL-HLT 2019 paper: AudioCapsβ144Updated 6 months ago
- Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)β14Updated last year
- 2.5D visual soundβ110Updated last year
- The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmenteβ¦β106Updated 11 months ago
- β44Updated 4 months ago
- Repo for Visual Acoustic Matching, CVPR 2022β65Updated last year
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.β349Updated 3 years ago
- Spatial Audio Generationβ100Updated last year
- This is the official implementation of our mesh-based neural network (MESH2IR) to generate acoustic impulse responses (IRs) for indoor 3Dβ¦β82Updated 3 months ago
- VisualEchoes Dataset (ECCV 2020)β34Updated 3 years ago
- Co-Separating Sounds of Visual Objects (ICCV 2019)β94Updated last year
- N/Aβ165Updated 2 years ago
- Python library for downloading, loading & working with sound datasetsβ325Updated last month
- A lightweight library for Frechet Audio Distance calculation.β237Updated 2 months ago
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating rβ¦β152Updated 3 months ago
- Impulse response generation based on state-of-the-art geometric sound propagation engine.β148Updated last year
- Scripts for download AudioSetβ68Updated 7 years ago
- The Cone of Silence:β151Updated 2 years ago
- Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimationβ19Updated last year
- Audio Dataset for training CLAP and other modelsβ637Updated 9 months ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Sβ¦β390Updated last year
- Toolkit for downloading and processing Google's AudioSet dataset.β162Updated last year
- PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]β263Updated 5 years ago
- Audio Captioning datasets for PyTorch.β107Updated 2 weeks ago
- Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generationβ104Updated last year
- Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)β152Updated 3 years ago
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".β386Updated 6 months ago
- β23Updated 4 years ago
- This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.β205Updated 3 months ago