apple / ml-spatial-librispeech
A large synthetic dataset of spatial audio with multiple labels
☆92Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ml-spatial-librispeech
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆42Updated 8 months ago
- Translating Synthetic RIRs to Real RIRs☆40Updated last year
- Evaluation and Benchmarking of Speech Super-resolution Methods☆141Updated 2 years ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆72Updated 2 months ago
- Pytorch implementation of subband decomposition☆89Updated 2 years ago
- ☆40Updated 5 months ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆153Updated 2 years ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆58Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆88Updated 3 months ago
- ☆104Updated last month
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆42Updated 2 months ago
- Expressive Anechoic Recordings of Speech (EARS)☆132Updated 4 months ago
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆34Updated last year
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆69Updated last month
- This is a curated list of awesome Speech Bandwidth Extension tutorials, papers, libraries, datasets, tools, scripts and results. The purp…☆62Updated 4 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆36Updated last month
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆80Updated last month
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆60Updated last year
- ☆61Updated 7 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆60Updated 9 months ago
- AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)☆93Updated 2 weeks ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆106Updated 2 months ago
- ☆64Updated last year
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…☆152Updated 3 months ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆81Updated 8 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆57Updated last year
- Speech Dereverberation using Fully Convolutional Networks☆68Updated 4 years ago
- This code is to run the WARP-Q speech quality metric.☆34Updated last month
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆54Updated last year
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆110Updated last year