rolczynski / Automatic-Speech-Recognition
π§ Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
β224Updated 4 years ago
Alternatives and similar repositories for Automatic-Speech-Recognition:
Users that are interested in Automatic-Speech-Recognition are comparing it to the libraries listed below
- A Keras CTC implementation of Baidu's DeepSpeech for model experimentationβ242Updated 7 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languagesβ471Updated 5 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ207Updated 3 years ago
- A fully convolution-network for speech-to-text, built on pytorch.β126Updated 4 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.β376Updated last year
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β537Updated 3 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.β170Updated 4 years ago
- Identifying people from small audio fragmentsβ170Updated 5 years ago
- ESPnet Model Zooβ249Updated last year
- Speaker diarization scripts, based on AaltoASRβ190Updated 6 years ago
- End-2-end speech synthesis with recurrent neural networksβ226Updated last year
- Problem Agnostic Speech Encoderβ440Updated last year
- DeepSpeech based forced alignment toolβ237Updated 4 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )β293Updated 3 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.β129Updated 4 years ago
- A neural attention model for speech command recognitionβ185Updated 2 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systemsβ205Updated 2 months ago
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0β244Updated 4 years ago
- DeepSpeech, Speech To Text, ASR, Speech recognition, Keras, Tensorflowβ30Updated 7 years ago
- Speech-to-text based on wav2letter built for transfer learningβ97Updated 2 years ago
- Voice Activity Detector in Pythonβ475Updated 4 years ago
- INTERSPEECH 2019 Tutorial Materialsβ193Updated 4 years ago
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"β679Updated last year
- PyTorch implementation of Tacotron speech synthesis model.β310Updated 5 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β365Updated 4 months ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.β122Updated 5 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networksβ439Updated 4 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.β72Updated 6 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"β367Updated 6 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videosβ154Updated 5 years ago