nassosoassos / sail_align
SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends…
☆98Updated 2 years ago
Alternatives and similar repositories for sail_align:
Users that are interested in sail_align are comparing it to the libraries listed below
- A script for audio/transcript alignment. Fork of p2fa.☆68Updated 7 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆64Updated 4 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 10 months ago
- This repository is now obsolete. Please go to https://github.com/idlak/idlak instead.☆39Updated 7 years ago
- Automatic prosodic annotation tool written in Java.☆60Updated 5 years ago
- Adapting your own Language Model for Kaldi☆64Updated 6 years ago
- ☆58Updated 5 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Scripts for LIUM SpkDiarization tools☆31Updated 7 years ago
- Phoneme Recognition using RecNet☆97Updated 8 years ago
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- A Collection of Speech Corpus for ASR and TTS☆113Updated 7 years ago
- Collection of machine learning demos for Automatic Speech Recognition☆55Updated 3 years ago
- ☆65Updated 11 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- Implementation of audio degradation processes☆101Updated 9 years ago
- Cross-lingual Voice Conversion☆97Updated 7 years ago
- Python interface for forced audio alignment using HTK and SoX☆335Updated 4 years ago
- A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.☆40Updated 6 years ago
- Tensorflow Implementation of Expressive Tacotron☆197Updated 6 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- This is now the official location of the Kaldi project.☆13Updated 5 years ago
- This is a speech analysis, modification and synthesis system☆51Updated 3 years ago
- ☆25Updated 7 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 5 years ago