chorusai / arpa2ipaLinks
A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)
☆15Updated 7 years ago
Alternatives and similar repositories for arpa2ipa
Users that are interested in arpa2ipa are comparing it to the libraries listed below
Sorting:
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Updated 3 years ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆140Updated last year
- ☆80Updated last month
- Labeled data for homograph disambiguation☆59Updated 2 years ago
- MFA acoustic model training based on Opencpop☆15Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆140Updated 3 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆124Updated 3 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆107Updated 6 months ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆102Updated last year
- ☆111Updated 3 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆87Updated 3 years ago
- ☆71Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆84Updated 2 years ago
- Chinese and English Bilinguish G2P☆21Updated 2 years ago
- Chinese Text Normalization and Dataset☆85Updated 3 years ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆33Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- Train the next generation of TTS systems.☆167Updated last year
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆53Updated 3 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆165Updated 3 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated last year
- ☆33Updated 2 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆170Updated 2 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆52Updated last year
- Implementation of StyleTTS for Mandarin☆11Updated 2 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated 2 years ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆96Updated last year
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆76Updated 2 months ago