nassosoassos / sail_align
SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends…
☆97Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for sail_align
- A script for audio/transcript alignment. Fork of p2fa.☆69Updated 6 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆78Updated 5 years ago
- Automatic prosodic annotation tool written in Java.☆57Updated 5 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 6 months ago
- Adapting your own Language Model for Kaldi☆64Updated 5 years ago
- ☆57Updated 5 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆62Updated 4 years ago
- A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.☆40Updated 5 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- A Collection of Speech Corpus for ASR and TTS☆112Updated 7 years ago
- ☆65Updated 10 years ago
- This repository is now obsolete. Please go to https://github.com/idlak/idlak instead.☆39Updated 6 years ago
- A repository for maintaing the fave-align and fave-extract toolkits☆115Updated 7 months ago
- Python wrapper for Kaldi decoders (Kaldi https://sourceforge.net/projects/kaldi/)☆80Updated 8 years ago
- Online decoder for Kaldi NNET2 and GMM speech recognition models with Python bindings.☆49Updated 7 years ago
- ☆26Updated 7 years ago
- This is now the official location of the Kaldi project.☆13Updated 5 years ago
- A high-level toolkit for speaker recognition, build on top of ALIZE-Core.☆125Updated 5 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- Implementation of audio degradation processes☆101Updated 9 years ago
- Long audio alignment using Kaldi☆25Updated 3 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 4 years ago
- Python interface for forced audio alignment using HTK and SoX☆331Updated 4 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 4 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆44Updated 4 years ago
- Server framework for Kaldi ASR Toolkit☆97Updated last year
- ☆40Updated 2 years ago
- A Praat plug-in for performing interactive phonetic forced alignment☆26Updated 6 years ago