open-speech/speech-aligner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/open-speech/speech-aligner)

open-speech / speech-aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

☆410

Alternatives and similar repositories for speech-aligner

Users that are interested in speech-aligner are comparing it to the libraries listed below

Sorting:

aishell-foundation / DaCiDian
View on GitHub
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
☆301Jun 15, 2020Updated 5 years ago
Kyubyong / g2pC
View on GitHub
g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
☆243Jul 10, 2019Updated 6 years ago
open-speech / cn-text-normalizer
View on GitHub
A python module that convert chinese written string to read string. 一个python包：将中文书面字符串转换为口语字符串。
☆124Oct 8, 2019Updated 6 years ago
speechio / BigCiDian
View on GitHub
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
☆262Oct 11, 2019Updated 6 years ago
speechio / chinese_text_normalization
View on GitHub
Chinese text normalization for speech processing
☆721Mar 18, 2023Updated 2 years ago
MontrealCorpusTools / Montreal-Forced-Aligner
View on GitHub
Command line utility for forced alignment using Kaldi
☆1,752Updated this week
kakaobrain / g2pm
View on GitHub
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
☆361Dec 24, 2021Updated 4 years ago
thuhcsi / Crystal
View on GitHub
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
☆229Aug 17, 2020Updated 5 years ago
Jackiexiao / MTTS
View on GitHub
A Demo of Mandarin/Chinese TTS frontend
☆285Apr 18, 2022Updated 3 years ago
ivanvovk / durian-pytorch
View on GitHub
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
☆184Aug 12, 2020Updated 5 years ago
xcmyz / FastSpeech
View on GitHub
The Implementation of FastSpeech based on pytorch.
☆880Jul 6, 2023Updated 2 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 3 years ago
dipjyoti92 / SC-WaveRNN
View on GitHub
Official PyTorch implementation of Speaker Conditional WaveRNN
☆110Jun 22, 2022Updated 3 years ago
candlewill / CNTN
View on GitHub
ChiNese Text Normalization (CNTN) tool for Text-to-speech system
☆37Apr 12, 2018Updated 7 years ago
r9y9 / icassp2020-espnet-tts-merlin-baseline
View on GitHub
ICASSP 2020 ESPnet-TTS: Merlin baseline system
☆36Oct 28, 2019Updated 6 years ago
Kyubyong / g2p
View on GitHub
g2p: English Grapheme To Phoneme Conversion
☆911Jan 5, 2023Updated 3 years ago
npuichigo / waveglow
View on GitHub
A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
☆205Nov 6, 2018Updated 7 years ago
yanggeng1995 / WaveRNN
View on GitHub
☆34Jul 16, 2019Updated 6 years ago
syoyo / tacotron-tts-cpp
View on GitHub
Tacotron text to speech in C++(synthesize only)
☆77Oct 17, 2019Updated 6 years ago
bshall / UniversalVocoding
View on GitHub
A PyTorch implementation of "Robust Universal Neural Vocoding"
☆238Nov 14, 2020Updated 5 years ago
yc9701 / pansori
View on GitHub
Tools for ASR Corpus Generation from Online Video
☆140Feb 10, 2019Updated 7 years ago
athena-team / DiDiSpeech
View on GitHub
☆45Oct 24, 2020Updated 5 years ago
rishikksh20 / PPSpeech
View on GitHub
PPSpeech: Phrase based Parallel End-to-End TTS System
☆35Aug 31, 2020Updated 5 years ago
geneing / WaveRNN-Pytorch
View on GitHub
Fatcord's Alternative WaveRNN (Faster training)
☆132Nov 29, 2020Updated 5 years ago
Deepest-Project / AlignTTS
View on GitHub
Implementation of the AlignTTS
☆77Jul 6, 2023Updated 2 years ago
tencent-ailab / pika
View on GitHub
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
☆344Dec 25, 2020Updated 5 years ago
andi611 / ZeroSpeech-TTS-without-T
View on GitHub
A Pytorch implementation for the ZeroSpeech 2019 challenge.
☆112Nov 12, 2019Updated 6 years ago
ksw0306 / ClariNet
View on GitHub
A Pytorch Implementation of ClariNet
☆292Aug 5, 2019Updated 6 years ago
xcmyz / FastVocoder
View on GitHub
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
☆157Jul 2, 2021Updated 4 years ago
pettarin / forced-alignment-tools
View on GitHub
A collection of links and notes on forced alignment tools
☆935Nov 10, 2021Updated 4 years ago
xiph / LPCNet
View on GitHub
Efficient neural speech synthesis
☆1,203Sep 21, 2024Updated last year
tiberiu44 / TTS-Cube
View on GitHub
End-2-end speech synthesis with recurrent neural networks
☆223Feb 24, 2024Updated 2 years ago
kan-bayashi / PytorchWaveNetVocoder
View on GitHub
WaveNet-Vocoder implementation with pytorch.
☆300Jun 8, 2020Updated 5 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
mkotha / WaveRNN
View on GitHub
A WaveRNN implementation
☆201Oct 14, 2019Updated 6 years ago
liusongxiang / efficient_tts
View on GitHub
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
☆116Dec 22, 2021Updated 4 years ago
hhguo / MSMC-TTS
View on GitHub
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
☆168Apr 10, 2024Updated last year
kylebgorman / textgrid
View on GitHub
A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat
☆298Nov 8, 2023Updated 2 years ago
thu-spmi / CAT
View on GitHub
A CRF-based ASR Toolkit
☆362Feb 5, 2026Updated 3 weeks ago