rolczynski / Automatic-Speech-Recognition
π§ Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
β224Updated 4 years ago
Alternatives and similar repositories for Automatic-Speech-Recognition
Users that are interested in Automatic-Speech-Recognition are comparing it to the libraries listed below
Sorting:
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ209Updated 3 years ago
- A Keras CTC implementation of Baidu's DeepSpeech for model experimentationβ242Updated 7 years ago
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0β245Updated 4 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.β376Updated last year
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β537Updated 3 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systemsβ206Updated 2 months ago
- A neural attention model for speech command recognitionβ185Updated 2 years ago
- DeepSpeech based forced alignment toolβ237Updated 4 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β365Updated 5 months ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.β584Updated 3 years ago
- Speaker diarization scripts, based on AaltoASRβ190Updated 6 years ago
- Identifying people from small audio fragmentsβ170Updated 5 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videosβ154Updated 5 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.β170Updated 4 years ago
- ASR with PyTorchβ139Updated 6 years ago
- Open tools and data for cloudless automatic speech recognitionβ447Updated 4 years ago
- Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).β130Updated 4 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brainβ648Updated 3 years ago
- Implementation of all-neural speech recognition systems using Keras and Tensorflowβ144Updated 7 years ago
- Problem Agnostic Speech Encoderβ440Updated last year
- Deep neural networks for getting text-independent speaker embedding written in TensorFlowβ309Updated 6 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.β72Updated 6 years ago
- A fully convolution-network for speech-to-text, built on pytorch.β126Updated 4 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languagesβ470Updated 5 years ago
- End-2-end speech synthesis with recurrent neural networksβ226Updated last year
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorchβ211Updated 4 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.β122Updated 5 years ago
- Speech-to-text based on wav2letter built for transfer learningβ97Updated 2 years ago
- End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)β314Updated 7 years ago
- Voice Activity Detection (VAD) using deep learning.β196Updated 5 years ago