jcsilva / multilingual-g2p
Multilingual Grapheme to Phoneme
☆49Updated 9 years ago
Alternatives and similar repositories for multilingual-g2p:
Users that are interested in multilingual-g2p are comparing it to the libraries listed below
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- multilingual speech aligner☆72Updated last year
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated last year
- CMU multilingual speech repository☆31Updated 2 years ago
- A phoneme-allophone database for many languages☆50Updated 4 years ago
- Alignment files of LibriTTS.☆61Updated 5 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- ☆35Updated last week
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- ☆51Updated 6 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- ☆40Updated 3 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆38Updated 2 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆39Updated 3 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 5 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated last year
- MelGAN implementation with Multi-Band and Full Band supports...☆61Updated 4 years ago
- ☆58Updated 5 years ago
- asr2k☆49Updated 9 months ago
- Tacotron2 with Global Style Tokens☆64Updated 5 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆67Updated 6 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆40Updated last year
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated last year
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- Text frontend for ESPnet tts recipes☆31Updated 3 years ago
- RawNet: Fast End-to-End Neural Vocoder☆41Updated 5 years ago