nassosoassos / sail_alignLinks
SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends…
☆98Updated 3 years ago
Alternatives and similar repositories for sail_align
Users that are interested in sail_align are comparing it to the libraries listed below
Sorting:
- A script for audio/transcript alignment. Fork of p2fa.☆68Updated 7 years ago
- Phoneme Recognition using RecNet☆97Updated 8 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- ☆58Updated 6 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated last year
- Long audio alignment using Kaldi☆23Updated 4 years ago
- Python interface for forced audio alignment using HTK and SoX☆341Updated 5 years ago
- A high-level toolkit for speaker recognition, build on top of ALIZE-Core.☆126Updated 6 years ago
- Adapting your own Language Model for Kaldi☆63Updated 6 years ago
- This is now the official location of the Kaldi project.☆13Updated 6 years ago
- A Collection of Speech Corpus for ASR and TTS☆114Updated 8 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- A repository for maintaing the fave-align and fave-extract toolkits☆117Updated last year
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- automatic spoken language identification☆90Updated 6 years ago
- Cross-lingual Voice Conversion☆97Updated 7 years ago
- A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.☆40Updated 6 years ago
- Tensorflow Implementation of Expressive Tacotron☆196Updated 6 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- DeepSpeech based forced alignment tool☆238Updated 4 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 5 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- how to generate the full-contextual labels from un-seen text for the application of HMM-based speech synthesis (HTS)☆11Updated 5 years ago
- Online decoder for Kaldi NNET2 and GMM speech recognition models with Python bindings.☆49Updated 8 years ago
- A tool for automatic phoneme transcription☆157Updated 2 years ago
- HTK features in Python☆73Updated 6 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…☆69Updated 7 years ago
- Speech Signal Processing - a small collection of routines in Python to do signal processing☆46Updated 6 years ago
- Collection of machine learning demos for Automatic Speech Recognition☆55Updated 3 years ago