基于语言学本体构建,全面覆盖汉语多音字、音变等现象的高效中文TTS数据集。A linguistically grounded and comprehensive Chinese TTS dataset, efficiently covering Chinese polyphonic characters, phonological changes, and more.
☆55Aug 13, 2024Updated last year
Alternatives and similar repositories for Chinese-TTS-Dataset
Users that are interested in Chinese-TTS-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Feb 5, 2024Updated 2 years ago
- C++版本的汉字转拼音 Transfer chinese character to pinyin☆15Aug 31, 2018Updated 7 years ago
- ☆23Oct 30, 2024Updated last year
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆371Sep 3, 2024Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆20Mar 12, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year
- This is the code of the paper "SpectrumFM: A Foundation Model for Intelligent Spectrum Management"☆35Dec 29, 2025Updated 3 months ago
- personal blog☆18Jun 8, 2022Updated 3 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- [ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆131Sep 2, 2025Updated 7 months ago
- Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)☆390Jun 21, 2025Updated 9 months ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 8 months ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 10 months ago
- A pakage for crawling audio from Youtube☆42Aug 8, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The official PyTorch implementation of VM-ASR, a model designed for high-fidelity audio super-resolution.☆21Sep 8, 2025Updated 7 months ago
- ASLP Summer Inter@NPU☆12Jul 30, 2024Updated last year
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110May 20, 2025Updated 10 months ago
- CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO Fine-Tuning!☆124Aug 8, 2025Updated 8 months ago
- ☆179Aug 25, 2025Updated 7 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆435Sep 13, 2024Updated last year
- Grapheme-to-Phoneme lexicons for Chinese dialects☆70Nov 20, 2022Updated 3 years ago
- Deep Articulatory Synthesis and Inversion☆55Feb 14, 2024Updated 2 years ago
- ☆132Apr 6, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Clone of the mp3gain sources from svn on sourceforge (http://mp3gain.sourceforge.net/)☆11Jan 3, 2013Updated 13 years ago
- Code for calculate DNS_MOS.☆43Dec 18, 2022Updated 3 years ago
- ☆49May 3, 2020Updated 5 years ago
- flow mirror models from JZX AI Labs☆43Sep 30, 2024Updated last year
- ☆43Feb 8, 2025Updated last year
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- ☆22Jul 10, 2025Updated 9 months ago
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆38Jul 13, 2018Updated 7 years ago
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆509Dec 22, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- api document for www.xt.com , www.xt.pub etc☆10Jun 17, 2022Updated 3 years ago
- frameworks_base for Geeksphone Peak and Keon☆12Jan 13, 2015Updated 11 years ago
- Split up any kind of Pinyin into an array of syllables.☆11Aug 14, 2024Updated last year
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated 2 years ago
- ☆14Jan 2, 2025Updated last year
- Official implementation of NeurIPS'24 paper Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features☆38May 28, 2025Updated 10 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆51Jul 3, 2024Updated last year