Jackiexiao / tts-frontend-datasetView external linksLinks
TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization
☆103Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for tts-frontend-dataset
Users that are interested in tts-frontend-dataset are comparing it to the libraries listed below
Sorting:
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 2 months ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆415Nov 20, 2025Updated 2 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆62Nov 1, 2024Updated last year
- Unoffical implementation of Megatts2☆288Mar 23, 2024Updated last year
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆366Sep 3, 2024Updated last year
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆433Sep 13, 2024Updated last year
- Train the next generation of TTS systems.☆171Sep 13, 2024Updated last year
- ☆25Jan 24, 2023Updated 3 years ago
- ☆140Jan 7, 2024Updated 2 years ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- ☆19Mar 22, 2024Updated last year
- ☆55Jan 13, 2023Updated 3 years ago
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆211Apr 26, 2024Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆146Aug 22, 2022Updated 3 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 6 months ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆184Mar 6, 2024Updated last year
- ☆111Apr 6, 2022Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated last year
- Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training☆135Feb 18, 2023Updated 3 years ago
- Chinese polyphone disambiguation for Text-to-Speech application☆42Jun 11, 2024Updated last year
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- ☆59Oct 22, 2025Updated 3 months ago
- HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform☆241Jan 14, 2025Updated last year
- ☆275Jun 8, 2024Updated last year
- ☆69May 19, 2023Updated 2 years ago
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆235Jul 3, 2024Updated last year
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆237Feb 29, 2024Updated last year
- Pytorch implementation of BigVSAN☆203Dec 9, 2025Updated 2 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆87Dec 20, 2024Updated last year
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆88Apr 2, 2024Updated last year
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆115Jun 23, 2025Updated 7 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆111Apr 1, 2024Updated last year
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆111Dec 20, 2024Updated last year
- CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO Fine-Tuning!☆116Aug 8, 2025Updated 6 months ago