apple / ml-spatial-librispeech
A large synthetic dataset of spatial audio with multiple labels
☆103Updated last year
Alternatives and similar repositories for ml-spatial-librispeech:
Users that are interested in ml-spatial-librispeech are comparing it to the libraries listed below
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆91Updated 7 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆64Updated last year
- Translating Synthetic RIRs to Real RIRs☆41Updated last year
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆51Updated 7 months ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- PAM is a no-reference audio quality metric for audio generation tasks☆57Updated 8 months ago
- BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models☆43Updated 5 months ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆155Updated 2 years ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆47Updated 2 weeks ago
- A simple package for Guided source separation (GSS)☆118Updated 10 months ago
- This is the official implementation of the LiSenNet☆68Updated 4 months ago
- ☆64Updated last year
- ☆189Updated last year
- Blind source separation with independent vector analysis family of algorithm in torch☆97Updated 2 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆95Updated 8 months ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆43Updated 3 weeks ago
- Python loaders for many Real Room Impulse Response databases☆88Updated 6 months ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆67Updated last year
- Official data preparation scripts for the URGENT 2024 Challenge☆77Updated 2 months ago
- ☆78Updated 9 months ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆101Updated last year
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆116Updated 3 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆49Updated 5 months ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- Evaluation and Benchmarking of Speech Super-resolution Methods☆149Updated 2 years ago
- HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement (ICASSP 2023)☆79Updated last year
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- Augmenting Room Impulse Response☆41Updated last year
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆96Updated 2 months ago