Connum / npm-pinyin2ipa
Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation
☆16Updated last year
Alternatives and similar repositories for npm-pinyin2ipa:
Users that are interested in npm-pinyin2ipa are comparing it to the libraries listed below
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆19Updated 2 years ago
- Mutiband version of HIFIGAN☆18Updated 4 years ago
- Torchaudio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆11Updated 4 months ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated 9 months ago
- Pre-trained grapheme-to-phoneme (G2P) models☆25Updated 3 years ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Updated last year
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Updated 2 years ago
- ☆25Updated 3 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Updated 6 years ago
- The implementation of g2pL with a new open dataset.☆16Updated last year
- ☆19Updated 2 years ago
- Chinese polyphone disambiguation for Text-to-Speech application☆34Updated 10 months ago
- ☆31Updated last year
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- Singing Voice Speech modeling test☆35Updated 2 years ago
- Megatts2 use HierSpeechpp's vocoder☆18Updated 4 months ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆16Updated 11 months ago
- ☆18Updated 7 months ago
- Convert English text from written expressions into spoken forms☆25Updated 2 years ago
- ☆25Updated 2 years ago
- ☆12Updated 2 months ago
- Forced alignment decoder for Whisper.☆14Updated last year
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Updated 3 years ago
- ☆13Updated 3 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆27Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆16Updated 3 weeks ago
- with alignment learning and continuous wavelet transform☆20Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- ☆22Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year