stefanpantic / asrLinks

Automatic speech recognition using neural networks

☆18

Alternatives and similar repositories for asr

Users that are interested in asr are comparing it to the libraries listed below

Sorting:

kate-egorova / ASR-hybrid-decoding
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Updated 5 years ago
craffel / mocha
Example implementation of Monotonic Chunkwise Attention.
☆52Updated 7 years ago
kaituoxu / Tacotron2
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…
☆52Updated 6 years ago
ksingla025 / Speaker_Dia_RedHen
This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project
☆10Updated 9 years ago
AppleHolic / audioset_augmentor
Sound augmentation using Large-scale audio dataset (Audioset)
☆45Updated 4 years ago
mjansche / tts-tutorial
Text-to-Speech tutorial at SLTU 2016
☆35Updated 9 years ago
iamjanvijay / rnnt
An implementation of RNN-Transducer loss in TF-2.0.
☆45Updated 2 years ago
Idlak / Living-Audio-Dataset
A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …
☆41Updated 2 years ago
CoEDL / kaldi_helpers
A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
☆15Updated 5 years ago
t13m / kaldi-readers-for-tensorflow
readers that enable reading kaldi ark in tensorflow
☆17Updated 7 years ago
idiap / IdiapTTS
A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis
☆23Updated 3 years ago
CiscoDevNet / g2p_seq2seq_pytorch
Grapheme to phoneme model for PyTorch
☆41Updated 2 years ago
m-wiesner / nnet_pytorch
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Updated 11 months ago
qqueing / speaker_embedding-pytorch
"An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation
☆19Updated 6 years ago
mdangschat / speech-corpus-dl
Download and preperation tool for free speech corpora.
☆16Updated 6 years ago
cyfer0618 / kaldi-pytorch-rnnlm
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Updated 5 years ago
athena-team / athena-transform
☆20Updated 5 years ago
joaoantoniocn / AM-MobileNet1D
The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…
☆30Updated last year
gooofy / py-nltools
A collection of basic python modules for spoken natural language processing
☆56Updated 5 years ago
bsxfan / meta-embeddings
Meta-embeddings are a probabilistic generalization of embeddings in machine learning.
☆22Updated 6 years ago
kaituoxu / kaldi-ktnet1
Kaldi extended by Kaituo XU with new features in nnet1.
☆12Updated 6 years ago
hlt-mt / TranscRater
An open-source tool for automatic speech recognition ASR quality estimation.
☆23Updated 5 years ago
uhh-lt / kaldi-model-server
Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone
☆35Updated 3 years ago
artbataev / end2end
Losses and decoders for end-to-end ASR and OCR
☆34Updated 4 years ago
vinayak19th / ASR-Low-Resource
A Kaldi/ESPnet based approach to perform automatic speech recognition on low resource languages
☆9Updated 4 years ago
OrcusCZ / NNAcousticModeling
☆24Updated 6 years ago
BUTSpeechFIT / BrnoLM
A neural language modeling toolkit built on PyTorch
☆18Updated 2 years ago
VITA-Group / Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆31Updated 3 years ago
bootphon / abkhazia
ABX and kaldi experiments on speech corpora made easy
☆32Updated 9 months ago
mmaciej2 / kaldi
This is now the official location of the Kaldi project.
☆13Updated 6 years ago