lingjzhu / CharsiuG2PView external linksLinks
Multilingual G2P in 100 languages
☆374May 26, 2023Updated 2 years ago
Alternatives and similar repositories for CharsiuG2P
Users that are interested in CharsiuG2P are comparing it to the libraries listed below
Sorting:
- Charsiu: A neural phonetic aligner.☆329Sep 19, 2022Updated 3 years ago
- Grapheme to phoneme conversion with deep learning.☆420Dec 8, 2023Updated 2 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆268Jul 29, 2023Updated 2 years ago
- XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)☆347Jul 22, 2024Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 2 years ago
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆366Sep 3, 2024Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆106Oct 9, 2024Updated last year
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆251Jun 5, 2025Updated 8 months ago
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆121Jan 24, 2023Updated 3 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆267Jan 13, 2025Updated last year
- Pytorch implementation of BigVSAN☆203Dec 9, 2025Updated 2 months ago
- Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)☆153Feb 1, 2023Updated 3 years ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆361Dec 24, 2021Updated 4 years ago
- A differentiable version of SPTK☆192Feb 3, 2026Updated last week
- Train the next generation of TTS systems.☆171Sep 13, 2024Updated last year
- ICASSP 2023 Accepted☆189May 6, 2024Updated last year
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …☆291Apr 6, 2023Updated 2 years ago
- ☆88Nov 1, 2022Updated 3 years ago
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆137Aug 17, 2023Updated 2 years ago
- HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform☆241Jan 14, 2025Updated last year
- Official implementation of SawSing (ISMIR'22)☆272Aug 28, 2022Updated 3 years ago
- ☆163Sep 19, 2022Updated 3 years ago
- ☆259May 15, 2023Updated 2 years ago
- An Open-source Streaming High-fidelity Neural Audio Codec☆498Mar 4, 2025Updated 11 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆190Jan 26, 2026Updated 2 weeks ago
- ☆46Apr 16, 2023Updated 2 years ago
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆223Oct 20, 2023Updated 2 years ago
- PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline☆432Apr 19, 2023Updated 2 years ago
- multilingual speech aligner☆76Nov 19, 2023Updated 2 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆47Mar 25, 2022Updated 3 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- ☆197May 3, 2024Updated last year
- A repository for benchmarking neural vocoders by their quality and speed.☆212May 30, 2025Updated 8 months ago
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆159Jun 13, 2024Updated last year
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis☆1,064Aug 7, 2024Updated last year
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆468Nov 17, 2022Updated 3 years ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆78Dec 3, 2024Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago