srinivr / kaldi-long-audio-alignmentLinks
Long audio alignment using Kaldi
☆23Updated 4 years ago
Alternatives and similar repositories for kaldi-long-audio-alignment
Users that are interested in kaldi-long-audio-alignment are comparing it to the libraries listed below
Sorting:
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- create CMakeLists.txt for kaldi☆20Updated 5 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆34Updated 6 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated last year
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Updated 6 years ago
- Text-to-Speech tutorial at SLTU 2016☆34Updated 9 years ago
- ☆48Updated 4 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 6 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 4 years ago
- Custom decoders for Kaldi☆79Updated 6 years ago
- ☆20Updated 6 years ago
- ☆20Updated 5 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Updated 2 years ago
- ☆76Updated 3 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 8 years ago
- it's ASR decoder and make graph project☆33Updated 3 years ago
- ☆41Updated 7 years ago
- Filtering and Noise Adding Tool☆29Updated 3 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago
- Adapting your own Language Model for Kaldi☆63Updated 6 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Updated 4 years ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 5 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 3 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Updated 3 years ago
- Pulse Model vocoder☆42Updated 6 years ago