☆77Apr 26, 2022Updated 3 years ago
Alternatives and similar repositories for SpanPSP
Users that are interested in SpanPSP are comparing it to the libraries listed below
Sorting:
- Chinese Text Normalization and Dataset☆91May 14, 2022Updated 3 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Feb 5, 2024Updated 2 years ago
- ☆111Apr 6, 2022Updated 3 years ago
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆197Sep 15, 2022Updated 3 years ago
- Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.☆194Jun 8, 2023Updated 2 years ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆361Dec 24, 2021Updated 4 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆168Apr 10, 2024Updated last year
- MFA acoustic model training based on Opencpop☆15Sep 23, 2022Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated last year
- Predict prosody labels for Chinese sentences.☆41Jul 7, 2022Updated 3 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆87Dec 20, 2022Updated 3 years ago
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆26Nov 4, 2023Updated 2 years ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Jun 6, 2022Updated 3 years ago
- ☆15Nov 11, 2024Updated last year
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆203Sep 4, 2022Updated 3 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion☆20Jul 9, 2019Updated 6 years ago
- ☆259May 15, 2023Updated 2 years ago
- 基于vits fastspeech2 visinger的tts模型☆24Mar 9, 2023Updated 2 years ago
- An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"☆98Jun 7, 2022Updated 3 years ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆35Aug 1, 2025Updated 7 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆87Dec 20, 2024Updated last year
- ☆36Mar 14, 2025Updated 11 months ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆268Jan 13, 2025Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- My vocoder experiments☆31Jul 26, 2025Updated 7 months ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆142Apr 27, 2024Updated last year
- Train the next generation of TTS systems.☆171Sep 13, 2024Updated last year
- g2p for english tts☆19Nov 10, 2022Updated 3 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 3 months ago
- ☆177Jul 9, 2024Updated last year
- PyTorch implementation of Tacotron and Tacotron2☆34Jul 19, 2022Updated 3 years ago
- 单独维护的中文TTS☆34Oct 28, 2022Updated 3 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Oct 15, 2021Updated 4 years ago
- ☆69Mar 31, 2021Updated 4 years ago