nassosoassos / sail_alignLinks
SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends…
☆98Updated 3 years ago
Alternatives and similar repositories for sail_align
Users that are interested in sail_align are comparing it to the libraries listed below
Sorting:
- A script for audio/transcript alignment. Fork of p2fa.☆69Updated 7 years ago
- ☆58Updated 6 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 8 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- Long audio alignment using Kaldi☆23Updated 4 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆82Updated last year
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- A Collection of Speech Corpus for ASR and TTS☆114Updated 8 years ago
- Adapting your own Language Model for Kaldi☆63Updated 6 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆34Updated 6 years ago
- Phoneme Recognition using RecNet☆96Updated 8 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- Easier analysis of large speech corpora☆23Updated 4 years ago
- A repository for maintaing the fave-align and fave-extract toolkits☆118Updated last year
- Automatic prosodic annotation tool written in Java.☆64Updated 6 years ago
- A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.☆40Updated 6 years ago
- Cross-lingual Voice Conversion☆97Updated 7 years ago
- pronunciation dictionaries for multiple languages☆90Updated 8 years ago
- GSoC'16 RedHen Labs☆11Updated 9 years ago
- DeepSpeech based forced alignment tool☆239Updated 4 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 3 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- A tool for automatic phoneme transcription☆160Updated 2 years ago
- automatic spoken language identification☆90Updated 6 years ago
- Python interface for forced audio alignment using HTK and SoX☆345Updated 5 years ago
- Online decoder for Kaldi NNET2 and GMM speech recognition models with Python bindings.☆49Updated 8 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆23Updated 6 years ago
- A phoneme-allophone database for many languages☆52Updated 5 years ago