chorusai / arpa2ipa
A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)
☆16Updated 7 years ago
Alternatives and similar repositories for arpa2ipa
Users that are interested in arpa2ipa are comparing it to the libraries listed below
Sorting:
- ☆80Updated 11 months ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆23Updated 3 years ago
- ☆71Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- MFA acoustic model training based on Opencpop☆14Updated 2 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆33Updated this week
- Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.☆43Updated last month
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 7 months ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆140Updated last year
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- ☆33Updated last year
- Convert English text from written expressions into spoken forms☆25Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆123Updated 2 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated last year
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Updated 2 years ago
- Implementation of StyleTTS for Mandarin☆11Updated last year
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆97Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- ☆112Updated 3 years ago
- ☆26Updated 3 months ago
- Collection of pretrained models for the Montreal Forced Aligner☆148Updated 10 months ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆33Updated 4 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆39Updated 3 years ago
- NVIDIA's FastPitch, extracted from the DeepLearningExamples repository☆13Updated last year
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Updated 3 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆85Updated 2 years ago
- Colab notebooks for Next-gen Kaldi☆27Updated last month
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Updated 4 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆119Updated 2 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago