errolyan / text_normalization_CHLinks
TTS前,文本标准化,将数字字母处理转化为汉字
☆12Updated last year
Alternatives and similar repositories for text_normalization_CH
Users that are interested in text_normalization_CH are comparing it to the libraries listed below
Sorting:
- Audio samples from ICML2019 "Almost Unsupervised Text to Speech and Automatic Speech Recognition"☆17Updated 6 years ago
 - PyTorch implementation of Retriever: Learning Content-Style Representation☆12Updated 2 years ago
 - speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Updated 6 years ago
 - ☆25Updated 3 years ago
 - 中文句子韵律分析&停顿推断☆10Updated 6 years ago
 - python wrap for hts engine☆14Updated 7 years ago
 - WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Updated 5 years ago
 - Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Updated 3 years ago
 - 一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.☆23Updated 6 years ago
 - ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆36Updated 7 years ago
 - Simulation of parallel synthesis with LPCNet vocoder☆14Updated 5 years ago
 - unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Updated 5 years ago
 - Phonemes and durations labeling based on whisper small☆11Updated last year
 - Arxiv automatically obtains the latest article service.☆11Updated 5 years ago
 - Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…☆14Updated last year
 - Unsupervised spoken sentence embeddings☆14Updated 2 years ago
 - ☆15Updated 6 years ago
 - wake-up word emotion recognition [APSIPA 2022]☆17Updated 2 years ago
 - A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆15Updated 11 months ago
 - Code for "Distribution-based Emotion Recognition in Conversation"☆19Updated 2 years ago
 - Open Source Speech/Text Data on AI☆18Updated 3 years ago
 - Google's TPGST reimplementation.☆34Updated 5 years ago
 - ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
 - A Tensorflow Implementation like "Neural Speech Synthesis with Transformer Network" Port From OpenSeq2Seq☆20Updated 2 years ago
 - using world vocoder to extract features and make data for training neural networks☆11Updated 8 years ago
 - ☆14Updated 2 years ago
 - An imporved version of Fastsinging singing voice synthesising system.☆20Updated 5 years ago
 - ☆19Updated 2 years ago
 - ☆18Updated 3 years ago
 - ☆11Updated 3 years ago