facebookresearch / libri-light
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
☆492Updated last year
Alternatives and similar repositories for libri-light:
Users that are interested in libri-light are comparing it to the libraries listed below
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆528Updated 2 years ago
- Library for Textless Spoken Language Processing☆537Updated last year
- Large, modern dataset for speech recognition☆670Updated last year
- A library for speech data augmentation in time-domain☆655Updated 3 years ago
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆373Updated 3 years ago
- ESPnet Model Zoo☆248Updated last year
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆455Updated last year
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset☆161Updated 6 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆334Updated 10 months ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆334Updated last year
- A list of publically available audio data that anyone can download for ASR or other speech activities☆206Updated 3 years ago
- Tools for handling speech data in machine learning projects.☆1,005Updated last week
- g2p: English Grapheme To Phoneme Conversion☆848Updated 2 years ago
- A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf☆369Updated 3 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆686Updated 2 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆439Updated last year
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆402Updated 3 years ago
- Multilingual G2P in 100 languages☆319Updated last year
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆671Updated 5 months ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆404Updated last year
- Grapheme to phoneme conversion with deep learning.☆381Updated last year
- see README☆340Updated 8 months ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆325Updated 2 years ago
- Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.☆147Updated 2 years ago
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆366Updated 2 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆230Updated 2 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Updated 3 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆239Updated 4 years ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆359Updated 8 months ago