apple/ml-spatial-librispeech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apple/ml-spatial-librispeech)

apple / ml-spatial-librispeech

A large synthetic dataset of spatial audio with multiple labels

☆127

Alternatives and similar repositories for ml-spatial-librispeech

Users that are interested in ml-spatial-librispeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Graphi07 / room-impulse-responses
View on GitHub
A list of publicly available room impulse response datasets and scripts to download them.
☆594May 11, 2026Updated 2 months ago
yongyizang / GSound-SIR
View on GitHub
A Python Room Spatial Impulse Response Ray-Tracing Toolkit
☆86Mar 4, 2026Updated 4 months ago
tencent-ailab / FRA-RIR
View on GitHub
☆214Dec 4, 2023Updated 2 years ago
popcornell / SparseLibriMix
View on GitHub
☆73Feb 15, 2021Updated 5 years ago
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
audiolabs / torch-pesq
View on GitHub
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
☆228Jul 14, 2023Updated 3 years ago
Audio-Experience-Design / LAPChallenge
View on GitHub
The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.
☆16Aug 12, 2025Updated 11 months ago
MRSAudio / MRSAudio_Main
View on GitHub
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations
☆43Oct 15, 2025Updated 9 months ago
merlresearch / neural-IIR-field
View on GitHub
Neural IIR Filter Field for HRTF Upsampling and Personalization
☆29Feb 26, 2024Updated 2 years ago
partha2409 / DCASE2024_seld_baseline
View on GitHub
☆52Dec 13, 2025Updated 7 months ago
choiHkk / Transformer-TTS-V2
View on GitHub
☆25Mar 6, 2024Updated 2 years ago
Audio-WestlakeU / NBSS
View on GitHub
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
☆362Jan 1, 2025Updated last year
marl / SpatialScaper
View on GitHub
☆75Aug 7, 2025Updated 11 months ago
GAMMA-UMD / pygsound
View on GitHub
Impulse response generation based on state-of-the-art geometric sound propagation engine.
☆177Jan 17, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
Audio-AGI / dcase2024_task9_baseline
View on GitHub
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
☆26Mar 27, 2024Updated 2 years ago
anton-jeran / MULTI-AUDIODEC
View on GitHub
This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.
☆54Mar 17, 2025Updated last year
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
donghoney0416 / DeepASA
View on GitHub
Official page of "DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis"
☆26Apr 15, 2026Updated 3 months ago
danielkrause / DCASE2022-data-generator
View on GitHub
Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3
☆47Apr 5, 2023Updated 3 years ago
patrickvonplaten / audio-gen-dreambooth
View on GitHub
☆23Jun 13, 2023Updated 3 years ago
IoSR-Surrey / RealRoomBRIRs
View on GitHub
Binaural impulse responses captured in real rooms.
☆41Mar 9, 2016Updated 10 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kamo-naoyuki / pytorch_complex
View on GitHub
A temporal module for PyTorch-ComplexTensor
☆44Jun 28, 2024Updated 2 years ago
line / open-universe
View on GitHub
Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.
☆118Aug 29, 2024Updated last year
audiolabs / anechoic-noise
View on GitHub
Generator for anechoic, non-stationary noise signals
☆12Aug 12, 2022Updated 3 years ago
apple / ml-interspeech2022-phi_rtn
View on GitHub
Repository accompanying the Interspeech 2022 publication titled "Space-Efficient Representation of Entity-centric Query Language Models" …
☆13Sep 8, 2022Updated 3 years ago
Sreyan88 / ReCLAP
View on GitHub
☆33Dec 23, 2025Updated 6 months ago
yzyouzhang / Audio_Research_in_US
View on GitHub
Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…
☆27Feb 27, 2026Updated 4 months ago
sp-uhh / buddy
View on GitHub
BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models
☆66Oct 18, 2024Updated last year
qiuqiangkong / audioflow
View on GitHub
☆128Updated this week
facebookresearch / ears_dataset
View on GitHub
Expressive Anechoic Recordings of Speech (EARS)
☆221Jun 25, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
JusperLee / SonicSim
View on GitHub
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
☆277Jan 22, 2025Updated last year
FrancoisGrondin / BIRD
View on GitHub
Big Impulse Response Dataset
☆159Oct 19, 2022Updated 3 years ago
sarulab-speech / SpatialCLAP
View on GitHub
☆19Oct 9, 2025Updated 9 months ago
WangHelin1997 / SoloAudio
View on GitHub
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.
☆119Jan 28, 2026Updated 5 months ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
yluo42 / TAC
View on GitHub
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
☆308Jun 15, 2021Updated 5 years ago
whojavumusic / HARP
View on GitHub
HARP: A Large-Scale Higher-Order Ambisonic Room Impulse Response Dataset
☆35Jun 3, 2025Updated last year