chorusai / arpa2ipa
A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)
☆17Updated 7 years ago
Alternatives and similar repositories for arpa2ipa:
Users that are interested in arpa2ipa are comparing it to the libraries listed below
- ☆79Updated 7 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated last year
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆21Updated 2 years ago
- multilingual speech aligner☆73Updated last year
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆32Updated 4 years ago
- Chinese and English Bilinguish G2P☆20Updated last year
- Implementation of StyleTTS for Mandarin☆11Updated last year
- ☆111Updated 2 years ago
- Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.☆32Updated 7 months ago
- Chinese Text Normalization and Dataset☆81Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆120Updated 2 years ago
- Text frontend for ESPnet tts recipes☆31Updated 3 years ago
- Convert English text from written expressions into spoken forms☆22Updated 2 years ago
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆118Updated 7 months ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆138Updated 8 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆48Updated 6 months ago
- MFA acoustic model training based on Opencpop☆12Updated 2 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆92Updated 11 months ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆113Updated 2 years ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆25Updated 6 months ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆115Updated 3 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆71Updated this week
- Colab notebooks for Next-gen Kaldi☆26Updated last month
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆39Updated 3 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆48Updated 8 months ago
- ☆70Updated last year
- Huawei Grad-TTS for Chinese☆45Updated last year
- ☆19Updated last year