nassosoassos/sail_align

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nassosoassos/sail_align)

nassosoassos / sail_align

SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends…

☆99

Alternatives and similar repositories for sail_align

Users that are interested in sail_align are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

prosodylab / Prosodylab-Aligner
View on GitHub
Python interface for forced audio alignment using HTK and SoX
☆351Jun 28, 2020Updated 6 years ago
georgepar / kaldi-grpc-server
View on GitHub
Deploy Kaldi models using grpc for bidirectional streaming.
☆17Sep 30, 2024Updated last year
dansoutner / kaldi2htk
View on GitHub
Script for converting kaldi GMM/HMM models to HTK format
☆11Jul 18, 2024Updated 2 years ago
srinivr / kaldi-long-audio-alignment
View on GitHub
Long audio alignment using Kaldi
☆23Apr 22, 2021Updated 5 years ago
MattShannon / htk_io
View on GitHub
Read and write HTK and HTS files from python.
☆20Mar 17, 2015Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
georgepar / slp
View on GitHub
Utils and modules for Speech Language and Multimodal processing using pytorch and pytorch lightning
☆22Feb 16, 2023Updated 3 years ago
strob / gentle
View on GitHub
gentle forced aligner
☆1,704Updated this week
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
tyiannak / pyTextClassification
View on GitHub
Training and using classifiers for textual documents
☆15Sep 16, 2016Updated 9 years ago
ondrejklejch / acoustic_punctuation
View on GitHub
NMT based punctuation prediction system using lexical and acoustic features .
☆14Mar 30, 2020Updated 6 years ago
tyiannak / AUROS
View on GitHub
A ROS framework for Audio Analysis
☆12Apr 5, 2017Updated 9 years ago
pettarin / forced-alignment-tools
View on GitHub
A collection of links and notes on forced alignment tools
☆942Jul 22, 2026Updated last week
igormq / ctcdecode-pytorch
View on GitHub
Python implementation of CTC beam search decoder + agnostic LM scorer
☆20Dec 16, 2020Updated 5 years ago
alokprasad / lpctron-tts-cpp
View on GitHub
C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.
☆32Oct 1, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
vadimkantorov / inferspeech
View on GitHub
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant
☆10Aug 12, 2019Updated 6 years ago
ucbvislab / p2fa-vislab
View on GitHub
A script for audio/transcript alignment. Fork of p2fa.
☆69Mar 15, 2018Updated 8 years ago
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
DistantSpeechRecognition / sweethomelisten
View on GitHub
☆17Apr 8, 2016Updated 10 years ago
jingyonghou / KWS_Max-pooling_RHE
View on GitHub
Mining effective negative training samples for keyword spotting (PyTorch)
☆66May 23, 2020Updated 6 years ago
TalnUPF / praat_web
View on GitHub
☆13Jun 30, 2026Updated last month
nsmartinez / WERpp
View on GitHub
Calculates the Word Error Rate between two text files
☆20Nov 10, 2022Updated 3 years ago
uasolo / FDA-DH
View on GitHub
R Code recipes for Functional Data Analysis for phonetic analysis.
☆13Jul 31, 2024Updated last year
athena-team / DiDiSpeech
View on GitHub
☆45Oct 24, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JoFrhwld / FAVE
View on GitHub
A repository for maintaing the fave-align and fave-extract toolkits
☆118Mar 29, 2024Updated 2 years ago
soskuthy / gamm_strategies
View on GitHub
Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"
☆10Jan 25, 2021Updated 5 years ago
dialogflow / asr-server
View on GitHub
FastCGI support for Kaldi ASR
☆185Apr 5, 2019Updated 7 years ago
xflr6 / features
View on GitHub
Feature set algebra for linguistics
☆17Jul 7, 2026Updated 3 weeks ago
mravanelli / pytorch_MLP_for_ASR
View on GitHub
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…
☆40Feb 10, 2018Updated 8 years ago
mjansche / tts-tutorial
View on GitHub
Text-to-Speech tutorial at SLTU 2016
☆35May 10, 2016Updated 10 years ago
alexnorton / transcript-model
View on GitHub
JSON schema and JavaScript model classes for dealing with time-aligned transcripts of speech.
☆16Aug 20, 2018Updated 7 years ago
OrcusCZ / NNAcousticModeling
View on GitHub
☆24Sep 25, 2018Updated 7 years ago
ottokart / punctuator
View on GitHub
An LSTM RNN for restoring missing punctuation in unsegmented text.
☆78Sep 24, 2016Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ChristopherCarignan / audio2stl
View on GitHub
Converts an audio file to a 3D spectrogram and (optionally) saves as a stereolithography (STL) file for 3D printing
☆22Oct 31, 2021Updated 4 years ago
bajibabu / make_full_labels
View on GitHub
how to generate the full-contextual labels from un-seen text for the application of HMM-based speech synthesis (HTS)
☆12Nov 22, 2019Updated 6 years ago
AswinKumar1 / Forced-Alignment
View on GitHub
GSoC'16 RedHen Labs
☆11Aug 22, 2016Updated 9 years ago
rhasspy / ipa2kaldi
View on GitHub
Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)
☆10Jun 2, 2021Updated 5 years ago
rupakvignesh / Lyrics-to-Audio-Alignment
View on GitHub
Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structur…
☆94Feb 13, 2018Updated 8 years ago
kastnerkyle / ez-phones
View on GitHub
Wrapper to pocketsphinx phoneme labeling tools
☆18Sep 9, 2016Updated 9 years ago
MattShannon / HTS-demo_CMU-ARCTIC-SLT-STRAIGHT-AR-decision-tree
View on GitHub
Autoregressive HMM version of the HTS demo for statistical speech synthesis (includes autoregressive clustering)
☆16Sep 12, 2014Updated 11 years ago