voidful / ipa2Links
Tools for convert Text to IPA in python
☆17Updated 2 years ago
Alternatives and similar repositories for ipa2
Users that are interested in ipa2 are comparing it to the libraries listed below
Sorting:
- Finetuning VITS Efficiently☆33Updated last year
- Cantonese Text to Speech with VITS implementation☆30Updated 2 years ago
- Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.☆45Updated 2 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆99Updated 8 months ago
- Putting flows on top of neural transducers for better TTS☆62Updated this week
- Grapheme-to-Phoneme lexicons for Chinese dialects☆69Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated last year
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Updated 2 years ago
- Pre-trained grapheme-to-phoneme (G2P) models☆25Updated 3 years ago
- Chinese and English Bilinguish G2P☆21Updated last year
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆125Updated 3 years ago
- ☆41Updated 2 years ago
- ☆22Updated 3 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Updated last year
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆20Updated 2 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆31Updated 10 months ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆92Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Updated 11 months ago
- Convert English text from written expressions into spoken forms☆25Updated 3 years ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- ☆11Updated last year
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- ☆20Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- Non Parallel Voice Conversion based on VITS☆24Updated 2 years ago
- End-To-End SpeechSynthesis system with knowledge distillation☆16Updated 2 years ago
- visual-text to speech☆14Updated 3 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated last year