chorusai / arpa2ipaLinks
A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)
☆15Updated 7 years ago
Alternatives and similar repositories for arpa2ipa
Users that are interested in arpa2ipa are comparing it to the libraries listed below
Sorting:
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- ☆80Updated last year
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- Labeled data for homograph disambiguation☆59Updated 2 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 3 years ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆36Updated last year
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Updated 3 years ago
- ☆111Updated 3 years ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆33Updated 5 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆38Updated this week
- Collection of pretrained models for the Montreal Forced Aligner☆158Updated last month
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆53Updated 2 years ago
- Collect Voice Conversion researches☆93Updated this week
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆201Updated 2 years ago
- MFA acoustic model training based on Opencpop☆15Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆84Updated 2 years ago
- Monotonic Alignment Search☆96Updated 2 months ago
- Predict prosody labels for Chinese sentences.☆41Updated 3 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆40Updated 4 years ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated last year
- LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search☆92Updated 3 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆137Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆86Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆77Updated 6 months ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆123Updated 3 years ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆140Updated last year
- Chinese Text Normalization and Dataset☆84Updated 3 years ago