thuhcsi/FlatTN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thuhcsi/FlatTN)

thuhcsi / FlatTN

Chinese Text Normalization and Dataset

☆91

Alternatives and similar repositories for FlatTN

Users that are interested in FlatTN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thuhcsi / SpanPSP
View on GitHub
☆76Apr 26, 2022Updated 4 years ago
Daisyqk / Automatic-Prosody-Annotation
View on GitHub
☆112Mar 9, 2026Updated 4 months ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
speechio / chinese_text_normalization
View on GitHub
Chinese text normalization for speech processing
☆734Mar 18, 2023Updated 3 years ago
yerfor / SyntaSpeech
View on GitHub
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
☆201Sep 4, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
thuhcsi / Crystal
View on GitHub
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
☆230Aug 17, 2020Updated 5 years ago
kakaobrain / g2pm
View on GitHub
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
☆367Dec 24, 2021Updated 4 years ago
shang0712 / HierTTS
View on GitHub
☆47Apr 16, 2023Updated 3 years ago
adelacvg / ttts
View on GitHub
Train the next generation of TTS systems.
☆169Sep 13, 2024Updated last year
SungFeng-Huang / Meta-TTS
View on GitHub
Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.
☆192Jun 8, 2023Updated 3 years ago
keonlee9420 / Comprehensive-E2E-TTS
View on GitHub
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…
☆147Jun 6, 2022Updated 4 years ago
BoragoCode / AttentionBasedProsodyPrediction
View on GitHub
Encoder and Decoder and Attention Based Prosody Prediction
☆68Jan 17, 2018Updated 8 years ago
xcmyz / FastVocoder
View on GitHub
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
☆157Jul 2, 2021Updated 5 years ago
Liu-Feng-deeplearning / TTS-frontend
View on GitHub
TTS-frontend with Bert and CRF/lstm (For Tacotron)
☆53Jun 2, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lakahaga / dc-comix-tts
View on GitHub
Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer
☆74Aug 21, 2023Updated 2 years ago
vara-tts / VARA-TTS
View on GitHub
Demo audio of VARA-TTS model
☆20Jun 11, 2021Updated 5 years ago
thuhcsi / VAENAR-TTS
View on GitHub
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
☆144Jul 8, 2021Updated 5 years ago
makerjackie / tts-frontend-dataset
View on GitHub
TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization
☆104Feb 5, 2024Updated 2 years ago
mutiann / neural-lexicon-reader
View on GitHub
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Jul 25, 2022Updated 3 years ago
Kyubyong / g2pC
View on GitHub
g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
☆246Jul 10, 2019Updated 7 years ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
View on GitHub
☆16Apr 4, 2022Updated 4 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
yuan1615 / AdaVocoder
View on GitHub
Adaptive Vocoder for Custom Voice
☆61Sep 22, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
burchim / EfficientConformer
View on GitHub
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
☆221Jun 22, 2023Updated 3 years ago
Takaaki-Saeki / zm-text-tts
View on GitHub
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆65May 30, 2023Updated 3 years ago
wenet-e2e / opencpop
View on GitHub
Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis
☆236Dec 10, 2025Updated 7 months ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
Zeqiang-Lai / Prosody_Prediction
View on GitHub
Predict prosody labels for Chinese sentences.
☆42Jul 7, 2022Updated 4 years ago
hhguo / MSMC-TTS
View on GitHub
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
☆168Apr 10, 2024Updated 2 years ago
rishikksh20 / Phone-Level-Mixture-Density-Network-for-TTS
View on GitHub
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
☆45Dec 1, 2021Updated 4 years ago
brentspell / torch-yin
View on GitHub
Yin pitch estimator in PyTorch
☆119Nov 7, 2022Updated 3 years ago
tts-tutorial / icassp2022
View on GitHub
☆64May 23, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yanggeng1995 / EATS
View on GitHub
A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech
☆127Jul 16, 2020Updated 6 years ago
descriptinc / cargan
View on GitHub
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
☆193Dec 8, 2022Updated 3 years ago
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago
keonlee9420 / STYLER
View on GitHub
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…
☆159Jun 5, 2025Updated last year
HLTSingapore / Emotional-Speech-Data
View on GitHub
This is the GitHub page for publicly available emotional speech data.
☆402Jan 6, 2022Updated 4 years ago
ivanvovk / durian-pytorch
View on GitHub
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
☆184Aug 12, 2020Updated 5 years ago
microsoft / NeuralSpeech
View on GitHub
☆1,460Feb 11, 2024Updated 2 years ago