hongwen-sun / speech-alignerLinks
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
☆15Updated 7 years ago
Alternatives and similar repositories for speech-aligner
Users that are interested in speech-aligner are comparing it to the libraries listed below
Sorting:
- ☆21Updated 5 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Updated 7 years ago
- ☆44Updated 5 years ago
- ☆45Updated 6 years ago
- magicspeech competition recipe☆18Updated 5 years ago
- Open Source Speech/Text Data on AI☆18Updated 3 years ago
- it's ASR decoder and make graph project☆33Updated 3 years ago
- ☆61Updated 2 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Updated 7 years ago
- A SPMI Lab toolkit for language models.☆11Updated 8 years ago
- c++ code for merlin tts☆22Updated 6 years ago
- Text frontend for ESPnet tts recipes☆34Updated 4 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Updated 6 years ago
- Google's TPGST reimplementation.☆34Updated 6 years ago
- An imporved version of Fastsinging singing voice synthesising system.☆20Updated 5 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 5 years ago
- ☆41Updated 7 years ago
- Tacotron text to speech in C++(synthesize only)☆77Updated 6 years ago
- ☆31Updated 7 years ago
- 基于随机森林和条件随机场的中文韵律预测模型☆28Updated last year
- Transformer based ASR Engine.☆13Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
- TTS model based on Transformer.☆58Updated 6 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 6 years ago
- ☆33Updated 4 years ago
- ☆51Updated 6 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆55Updated 3 months ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 5 years ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 6 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated 2 years ago