qqueing / pytorch-G2P
(semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean
☆23Updated 6 years ago
Related projects: ⓘ
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆34Updated 3 years ago
- PyTorch based speaker embedding model☆15Updated 5 months ago
- ABX and kaldi experiments on speech corpora made easy☆31Updated last year
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- ☆11Updated 6 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- ☆51Updated 5 years ago
- A pakage for crawling audio from Youtube☆41Updated last year
- readers that enable reading kaldi ark in tensorflow☆17Updated 6 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆13Updated 4 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- Long audio alignment using Kaldi☆25Updated 3 years ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 4 years ago
- Script for converting kaldi GMM/HMM models to HTK format☆11Updated 2 months ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆35Updated 4 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆51Updated 4 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 4 years ago
- Google's TPGST reimplementation.☆34Updated 4 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆42Updated last year
- using world vocoder to extract features and make data for training neural networks☆11Updated 6 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 5 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆24Updated 4 years ago
- ☆19Updated 5 years ago
- VQVAE for Unsupervised Voice Conversion☆21Updated 5 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated 6 months ago