chdzq / ARPAbetAndIPAConvertor
☆64Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ARPAbetAndIPAConvertor
- ☆110Updated 2 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆130Updated 7 months ago
- MelGAN implementation with Multi-Band and Full Band supports...☆60Updated 4 years ago
- Chinese Text Normalization and Dataset☆81Updated 2 years ago
- Predict prosody labels for Chinese sentences.☆40Updated 2 years ago
- ☆51Updated 5 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated last year
- ☆74Updated 2 years ago
- ☆77Updated 6 months ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆109Updated 2 years ago
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Updated 4 years ago
- Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices☆19Updated 2 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆87Updated 2 years ago
- ☆69Updated 3 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆93Updated 2 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆53Updated 2 years ago
- End-to-end spoken language identification out of the box.☆48Updated 3 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆83Updated last year
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆66Updated 7 months ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆218Updated 3 months ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆50Updated 4 years ago
- Charsiu: A neural phonetic aligner.☆278Updated 2 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆81Updated last year
- Implementation of the AlignTTS☆76Updated last year