stefanpantic / asrLinks
Automatic speech recognition using neural networks
☆18Updated 4 years ago
Alternatives and similar repositories for asr
Users that are interested in asr are comparing it to the libraries listed below
Sorting:
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
- Example implementation of Monotonic Chunkwise Attention.☆52Updated 7 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Updated 9 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- Text-to-Speech tutorial at SLTU 2016☆35Updated 9 years ago
- An implementation of RNN-Transducer loss in TF-2.0.☆45Updated 2 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 5 years ago
- readers that enable reading kaldi ark in tensorflow☆17Updated 7 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 3 years ago
- Grapheme to phoneme model for PyTorch☆41Updated 2 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 11 months ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Updated 6 years ago
- Download and preperation tool for free speech corpora.☆16Updated 6 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 5 years ago
- ☆20Updated 5 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- A collection of basic python modules for spoken natural language processing☆56Updated 5 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Updated 6 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Updated 5 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Updated 3 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Updated 4 years ago
- A Kaldi/ESPnet based approach to perform automatic speech recognition on low resource languages☆9Updated 4 years ago
- ☆24Updated 6 years ago
- A neural language modeling toolkit built on PyTorch☆18Updated 2 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆31Updated 3 years ago
- ABX and kaldi experiments on speech corpora made easy☆32Updated 9 months ago
- This is now the official location of the Kaldi project.☆13Updated 6 years ago