基于语言学本体构建,全面覆盖汉语多音字、音变等现象的高效中文TTS数据集。A linguistically grounded and comprehensive Chinese TTS dataset, efficiently covering Chinese polyphonic characters, phonological changes, and more.
☆54Aug 13, 2024Updated last year
Alternatives and similar repositories for Chinese-TTS-Dataset
Users that are interested in Chinese-TTS-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Feb 5, 2024Updated 2 years ago
- C++版本的汉字转拼音 Transfer chinese character to pinyin☆15Aug 31, 2018Updated 7 years ago
- ☆23Oct 30, 2024Updated last year
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆372Sep 3, 2024Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆20Mar 12, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year
- This is the code of the paper "SpectrumFM: A Foundation Model for Intelligent Spectrum Management"☆27Dec 29, 2025Updated 2 months ago
- personal blog☆18Jun 8, 2022Updated 3 years ago
- MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆130Sep 2, 2025Updated 6 months ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)☆387Jun 21, 2025Updated 9 months ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 8 months ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 9 months ago
- A pakage for crawling audio from Youtube☆42Aug 8, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The official PyTorch implementation of VM-ASR, a model designed for high-fidelity audio super-resolution.☆21Sep 8, 2025Updated 6 months ago
- ASLP Summer Inter@NPU☆12Jul 30, 2024Updated last year
- A reviewed paper list about applying deep learning models for smarter transportation systems☆12Sep 15, 2020Updated 5 years ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110May 20, 2025Updated 10 months ago
- CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO Fine-Tuning!☆122Aug 8, 2025Updated 7 months ago
- ☆175Aug 25, 2025Updated 7 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆435Sep 13, 2024Updated last year
- Grapheme-to-Phoneme lexicons for Chinese dialects☆70Nov 20, 2022Updated 3 years ago
- ☆130Mar 2, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆89Dec 28, 2025Updated 2 months ago
- Clone of the mp3gain sources from svn on sourceforge (http://mp3gain.sourceforge.net/)☆11Jan 3, 2013Updated 13 years ago
- ☆49May 3, 2020Updated 5 years ago
- flow mirror models from JZX AI Labs☆43Sep 30, 2024Updated last year
- ☆43Feb 8, 2025Updated last year
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…☆166Mar 6, 2026Updated 2 weeks ago
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆508Dec 22, 2025Updated 3 months ago
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆38Jul 13, 2018Updated 7 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- frameworks_base for Geeksphone Peak and Keon☆12Jan 13, 2015Updated 11 years ago
- api document for www.xt.com , www.xt.pub etc☆10Jun 17, 2022Updated 3 years ago
- Split up any kind of Pinyin into an array of syllables.☆11Aug 14, 2024Updated last year
- ☆13Jan 2, 2025Updated last year
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated last year
- Official implementation of NeurIPS'24 paper Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features☆38May 28, 2025Updated 9 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆51Jul 3, 2024Updated last year