speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
☆410Apr 8, 2020Updated 5 years ago
Alternatives and similar repositories for speech-aligner
Users that are interested in speech-aligner are comparing it to the libraries listed below
Sorting:
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 5 years ago
- g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese☆243Jul 10, 2019Updated 6 years ago
- A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。☆124Oct 8, 2019Updated 6 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆262Oct 11, 2019Updated 6 years ago
- Chinese text normalization for speech processing☆721Mar 18, 2023Updated 2 years ago
- Command line utility for forced alignment using Kaldi☆1,752Updated this week
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆361Dec 24, 2021Updated 4 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Aug 17, 2020Updated 5 years ago
- A Demo of Mandarin/Chinese TTS frontend☆285Apr 18, 2022Updated 3 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- The Implementation of FastSpeech based on pytorch.☆880Jul 6, 2023Updated 2 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Jun 22, 2022Updated 3 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- g2p: English Grapheme To Phoneme Conversion☆911Jan 5, 2023Updated 3 years ago
- A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆205Nov 6, 2018Updated 7 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- Tacotron text to speech in C++(synthesize only)☆77Oct 17, 2019Updated 6 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- Tools for ASR Corpus Generation from Online Video☆140Feb 10, 2019Updated 7 years ago
- ☆45Oct 24, 2020Updated 5 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Nov 29, 2020Updated 5 years ago
- Implementation of the AlignTTS☆77Jul 6, 2023Updated 2 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆344Dec 25, 2020Updated 5 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Nov 12, 2019Updated 6 years ago
- A Pytorch Implementation of ClariNet☆292Aug 5, 2019Updated 6 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- A collection of links and notes on forced alignment tools☆935Nov 10, 2021Updated 4 years ago
- Efficient neural speech synthesis☆1,203Sep 21, 2024Updated last year
- End-2-end speech synthesis with recurrent neural networks☆223Feb 24, 2024Updated 2 years ago
- WaveNet-Vocoder implementation with pytorch.☆300Jun 8, 2020Updated 5 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- A WaveRNN implementation☆201Oct 14, 2019Updated 6 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆168Apr 10, 2024Updated last year
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆298Nov 8, 2023Updated 2 years ago
- A CRF-based ASR Toolkit☆362Feb 5, 2026Updated 3 weeks ago