chorusai / arpa2ipaLinks
A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)
☆15Updated 7 years ago
Alternatives and similar repositories for arpa2ipa
Users that are interested in arpa2ipa are comparing it to the libraries listed below
Sorting:
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- ☆80Updated 4 months ago
- Collection of pretrained models for the Montreal Forced Aligner☆180Updated 2 months ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆124Updated 3 years ago
- ☆111Updated 3 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆87Updated 3 years ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆140Updated last year
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Updated 3 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆145Updated 3 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆35Updated 6 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆172Updated 2 years ago
- Collect Voice Conversion researches☆96Updated this week
- multilingual speech aligner☆77Updated 2 years ago
- Monotonic Alignment Search☆100Updated 6 months ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆97Updated 2 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆102Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Updated last year
- Train the next generation of TTS systems.☆170Updated last year
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆42Updated 3 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆183Updated 2 weeks ago
- Toolbox for easy and qualitative one-shot voice conversion☆46Updated 4 years ago
- ☆69Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated 2 weeks ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Updated 3 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆126Updated last year
- Labeled data for homograph disambiguation☆62Updated 2 years ago
- Interface for Controllable Expressive Talking Machine☆39Updated 3 months ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆54Updated 3 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Updated 11 months ago