RevoSpeechTech / speech-datasets-collectionLinks

a curated list of speech datasets (110+ datasets, 75+ easy to download)

☆144

Alternatives and similar repositories for speech-datasets-collection

Users that are interested in speech-datasets-collection are comparing it to the libraries listed below

Sorting:

SpeechColab / GigaSpeech2
An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
☆165Updated last month
k2-fsa / libriheavy
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
☆199Updated 11 months ago
sarulab-speech / UTMOSv2
UTokyo-SaruLab MOS Prediction System
☆225Updated 3 weeks ago
sarulab-speech / UTMOS22
UT-Sarulab MOS prediction system using SSL models
☆254Updated last year
lifeiteng / naturalspeech3_facodec
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
☆213Updated last year
Zain-Jiang / Speech-Editing-Toolkit
It's a repository for implementations of neural speech editing algorithms.
☆200Updated last year
Takaaki-Saeki / DiscreteSpeechMetrics
Reference-aware automatic speech evaluation toolkit
☆160Updated 8 months ago
mct10 / RepCodec
Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization
☆182Updated last year
yanghaha0908 / FastHuBERT
Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning
☆94Updated 9 months ago
tarepan / SpeechMOS
Easy-to-Use Speech MOS predictors
☆305Updated last year
marianne-m / brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
☆156Updated 2 months ago
voidful / Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
☆270Updated last month
LqNoob / Neural-Codec-and-Speech-Language-Models
Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models
☆178Updated last week
imxtx / awesome-controllable-speech-synthesis
This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".
☆152Updated last week
yl4579 / PL-BERT
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
☆260Updated 7 months ago
ga642381 / Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
☆112Updated 2 years ago
imdatceleste / m-ailabs-dataset
This is the M-AILABS Speech Dataset
☆76Updated 8 months ago
nii-yamagishilab / ZMM-TTS
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
☆172Updated last year
hhguo / MSMC-TTS
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
☆166Updated last year
ankitapasad / layerwise-analysis
Layer-wise analysis of self-supervised pre-trained speech representations
☆114Updated 10 months ago
xi-j / Mamba-ASR
ConMamba for Automatic Speech Recognition
☆81Updated last year
lucasnewman / best-rq-pytorch
Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.
☆123Updated last year
wenet-e2e / wesep
Target Speaker Extraction Toolkit
☆190Updated 3 weeks ago
Mikxox / EnCodec_Trainer
☆61Updated 2 years ago
0nutation / USLM
Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)
☆149Updated last year
AndreevP / wvmos
MOS score prediction by fine-tuned wav2vec2.0 model
☆163Updated 2 years ago
Audio-WestlakeU / FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …
☆140Updated 3 weeks ago
MontrealCorpusTools / mfa-models
Collection of pretrained models for the Montreal Forced Aligner
☆160Updated 2 months ago
X-LANCE / UniCATS-CTX-vec2wav
[AAAI 2024] Code for CTX-vec2wav in UniCATS
☆129Updated last year
mkunes / w2v2_audioFrameClassification
wav2vec2 audio classification for prosodic boundary detection and other tasks
☆41Updated 2 years ago