hongwen-sun/speech-aligner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hongwen-sun/speech-aligner)

hongwen-sun / speech-aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

☆15

Alternatives and similar repositories for speech-aligner

Users that are interested in speech-aligner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
BUTSpeechFIT / ASR-hybrid-decoding
View on GitHub
☆17Nov 25, 2019Updated 6 years ago
TeaPoly / warp-ctc-crf
View on GitHub
An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.
☆12Jul 5, 2021Updated 5 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dense-analysis / vim-speech
View on GitHub
Vim Speech Recognition Experiments
☆20May 30, 2025Updated last year
Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune
View on GitHub
This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …
☆13Dec 4, 2024Updated last year
wangkenpu / Adaptation-Interspeech18
View on GitHub
Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model
☆13Nov 25, 2019Updated 6 years ago
csukuangfj / kaldilm
View on GitHub
Python wrapper for kaldi's arpa2fst
☆38Aug 27, 2025Updated 11 months ago
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
shtoshni / g2p
View on GitHub
Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models
☆15Feb 20, 2019Updated 7 years ago
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
irebai / SpecAugment_KALDI
View on GitHub
A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆15Sep 4, 2019Updated 6 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mbzuai-nlp / sttatts
View on GitHub
☆31Oct 29, 2024Updated last year
open-speech / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆410Apr 8, 2020Updated 6 years ago
sh01k / imp_tsp
View on GitHub
Measuring impulse response with time-stretched pulse (TSP) signal
☆14Jul 3, 2019Updated 7 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
Riroaki / Chinese-Rhythm-Predictor
View on GitHub
基于随机森林和条件随机场的中文韵律预测模型
☆28Jul 25, 2024Updated 2 years ago
kakaobrain / g2pm
View on GitHub
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
☆367Dec 24, 2021Updated 4 years ago
ffxiong / uaspeech
View on GitHub
Baseline kaldi script for UA-SPEECH corpus
☆32Oct 16, 2024Updated last year
yuboona / some-script-to-help-using-Montreal-Forced-Aligner
View on GitHub
Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…
☆14Feb 9, 2024Updated 2 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
speechpro / mixup
View on GitHub
☆24Mar 13, 2020Updated 6 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
guanlongzhao / ppg-gmm
View on GitHub
Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"
☆36Jan 15, 2020Updated 6 years ago
fgnt / LatticeWordSegmentation
View on GitHub
Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model
☆17Nov 24, 2016Updated 9 years ago
IoSR-Surrey / RealRoomBRIRs
View on GitHub
Binaural impulse responses captured in real rooms.
☆41Mar 9, 2016Updated 10 years ago
robin1001 / nn-vad
View on GitHub
simple dnn based vad
☆69Dec 2, 2018Updated 7 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
groadabike / Kaldi-Dsing-task
View on GitHub
DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.
☆19Jul 9, 2026Updated 2 weeks ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
idiap / inv-tn
View on GitHub
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)
☆21Sep 27, 2017Updated 8 years ago
ag1988 / mel-asr
View on GitHub
The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…
☆21Oct 11, 2024Updated last year
datemoon / tf-code-acoustics
View on GitHub
it's a train acoustics model code lib
☆27May 20, 2020Updated 6 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
lifeiteng / NaturalSpeech2
View on GitHub
☆33Jun 29, 2023Updated 3 years ago
zzw922cn / LPC_for_TTS
View on GitHub
Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.
☆72Mar 19, 2021Updated 5 years ago
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago