apple / ml-spatial-librispeechLinks
A large synthetic dataset of spatial audio with multiple labels
☆116Updated last year
Alternatives and similar repositories for ml-spatial-librispeech
Users that are interested in ml-spatial-librispeech are comparing it to the libraries listed below
Sorting:
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆106Updated last year
- Expressive Anechoic Recordings of Speech (EARS)☆192Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆70Updated 2 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆157Updated 3 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆152Updated 3 years ago
- An implementation of audio source separation tools.☆83Updated 2 years ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆115Updated last year
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆94Updated last year
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆46Updated 7 months ago
- Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"☆57Updated last year
- Translating Synthetic RIRs to Real RIRs☆43Updated 2 years ago
- BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models☆57Updated 11 months ago
- This is the official implementation of reverberant speech to room impulse response estimator☆38Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆50Updated 6 months ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆90Updated 2 months ago
- AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)☆100Updated 11 months ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆74Updated 2 years ago
- ☆106Updated last month
- Pytorch implementation of subband decomposition☆92Updated 3 years ago
- This code is to run the WARP-Q speech quality metric.☆35Updated 11 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆99Updated last year
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆96Updated 2 months ago
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…☆170Updated last year
- A DDSP-based neural voice synthesiser.☆119Updated 10 months ago
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆36Updated 2 years ago
- PAM is a no-reference audio quality metric for audio generation tasks☆74Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆38Updated 4 months ago
- This is a curated list of awesome Speech Bandwidth Extension tutorials, papers, libraries, datasets, tools, scripts and results. The purp…☆68Updated 5 years ago
- ☆65Updated 2 years ago
- Implementation of FiNS model for RIR estimation☆34Updated last year