facebookresearch/ears_dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/ears_dataset)

facebookresearch / ears_dataset

Expressive Anechoic Recordings of Speech (EARS)

☆221

Alternatives and similar repositories for ears_dataset

Users that are interested in ears_dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sp-uhh / ears_benchmark
View on GitHub
Generation scripts for EARS-WHAM and EARS-Reverb
☆48Jul 4, 2025Updated last year
facebookresearch / FlowDec
View on GitHub
An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.
☆212Jun 22, 2026Updated last month
line / LibriTTS-P
View on GitHub
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
☆161Jun 13, 2024Updated 2 years ago
urgent-challenge / urgent2024_challenge
View on GitHub
Official data preparation scripts for the URGENT 2024 Challenge
☆90May 21, 2025Updated last year
alessandroragano / scoreq
View on GitHub
SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)
☆114Aug 1, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
microsoft / SIG-Challenge
View on GitHub
☆93Jun 9, 2024Updated 2 years ago
Ereboas / MagiCodec
View on GitHub
A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.
☆125Jun 4, 2025Updated last year
bfs18 / rfwave
View on GitHub
☆152Apr 25, 2025Updated last year
yluo42 / SRVQ
View on GitHub
Spherical residual vector quantization (SRVQ)
☆31Aug 25, 2024Updated last year
sp-uhh / storm
View on GitHub
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
☆255Sep 13, 2024Updated last year
Aria-K-Alethia / BigCodec
View on GitHub
Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"
☆218Sep 19, 2024Updated last year
IDEA-Emdoor-Lab / DistilCodec
View on GitHub
A Neural Audio Codec (NAC) for Universal Audio
☆46May 30, 2025Updated last year
XZWY / SpatialCodec
View on GitHub
Implementation of SpatialCodec.
☆71Sep 23, 2023Updated 2 years ago
yxlu-0102 / MP-SENet
View on GitHub
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
☆493May 19, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Andong-Li-speech / BridgeVoC
View on GitHub
This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".
☆67Nov 5, 2025Updated 8 months ago
tencent-ailab / FRA-RIR
View on GitHub
☆214Dec 4, 2023Updated 2 years ago
yuguochencuc / BAE-Net
View on GitHub
BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION
☆80Aug 20, 2024Updated last year
yukara-ikemiya / Open-Miipher-2
View on GitHub
PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind
☆70Sep 22, 2025Updated 10 months ago
sp-uhh / sgmse
View on GitHub
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
☆764May 12, 2026Updated 2 months ago
BakerBunker / FreeV
View on GitHub
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
☆98Jul 4, 2024Updated 2 years ago
winddori2002 / DEX-TTS
View on GitHub
DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability
☆108Jan 17, 2025Updated last year
AI-S2-Lab / FluentEditor
View on GitHub
[InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency
☆62Oct 23, 2024Updated last year
lifeiteng / NotebookTTS
View on GitHub
Text-To-Speech for NotebookLM
☆39Jul 20, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
facebookresearch / AudioDec
View on GitHub
An Open-source Streaming High-fidelity Neural Audio Codec
☆510Mar 4, 2025Updated last year
line / open-universe
View on GitHub
Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.
☆118Aug 29, 2024Updated last year
nanless / universal-speech-enhancement
View on GitHub
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…
☆82Jul 29, 2024Updated last year
voidful / Codec-SUPERB
View on GitHub
Audio Codec Speech processing Universal PERformance Benchmark
☆308Jul 4, 2026Updated 2 weeks ago
csteinmetz1 / auraloss
View on GitHub
Collection of audio-focused loss functions in PyTorch
☆874Jul 30, 2024Updated last year
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
Audio-WestlakeU / RealMAN
View on GitHub
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…
☆175Apr 29, 2025Updated last year
walker-hyf / NCSSD
View on GitHub
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆61Nov 1, 2024Updated last year
RoyChao19477 / SEMamba
View on GitHub
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
☆273Dec 12, 2025Updated 7 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
audiolabs / torch-pesq
View on GitHub
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
☆228Jul 14, 2023Updated 3 years ago
Mddct / transformer-vocos
View on GitHub
☆35Sep 6, 2025Updated 10 months ago
ssi-research / FQSE
View on GitHub
Fully Quantized Neural Networks For Speech Enhancement
☆65Feb 15, 2024Updated 2 years ago
yangdongchao / SimpleSpeech
View on GitHub
The open source code for SimpleSpeech series
☆147Oct 8, 2024Updated last year
ictnlp / SLED-TTS
View on GitHub
Streamable Text-to-Speech model using a language modeling approach, without vector quantization
☆108May 20, 2025Updated last year
facebookresearch / audiobox-aesthetics
View on GitHub
Unified automatic quality assessment for speech, music, and sound.
☆745Jun 5, 2025Updated last year
hhguo / SoCodec
View on GitHub
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
☆92Dec 20, 2024Updated last year