rolczynski / Automatic-Speech-Recognition
π§ Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
β224Updated 4 years ago
Alternatives and similar repositories for Automatic-Speech-Recognition:
Users that are interested in Automatic-Speech-Recognition are comparing it to the libraries listed below
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β535Updated 3 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.β376Updated last year
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ203Updated 3 years ago
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0β243Updated 4 years ago
- DeepSpeech based forced alignment toolβ237Updated 4 years ago
- A Keras CTC implementation of Baidu's DeepSpeech for model experimentationβ241Updated 6 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196β310Updated 4 years ago
- A neural attention model for speech command recognitionβ183Updated last year
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languagesβ467Updated 4 years ago
- Voice Activity Detection based on Deep Learning & TensorFlowβ358Updated last year
- A fully convolution-network for speech-to-text, built on pytorch.β126Updated 4 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.β65Updated 5 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.β83Updated 6 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brainβ646Updated 2 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.β582Updated 3 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.β170Updated 3 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videosβ154Updated 5 years ago
- Open tools and data for cloudless automatic speech recognitionβ447Updated 3 years ago
- A pure python module for reading and writing kaldi ark filesβ252Updated last year
- ASR with PyTorchβ140Updated 5 years ago
- PyTorch implementations of neural network models for keyword spottingβ514Updated last year
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"β368Updated 6 years ago
- End-2-end speech synthesis with recurrent neural networksβ226Updated 11 months ago
- Keras implementation of ββDeep Speaker: an End-to-End Neural Speaker Embedding Systemββ (speaker recognition)β248Updated 4 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β366Updated 2 months ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networksβ434Updated 4 years ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognitionβ478Updated 3 years ago
- Share some recent speaker recognition papers and their implementations.β90Updated 5 years ago
- Paper: https://arxiv.org/abs/1702.02285β63Updated 6 years ago
- Speaker diarization scripts, based on AaltoASRβ190Updated 6 years ago