rolczynski / Automatic-Speech-Recognition
π§ Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
β223Updated 4 years ago
Alternatives and similar repositories for Automatic-Speech-Recognition:
Users that are interested in Automatic-Speech-Recognition are comparing it to the libraries listed below
- A Keras CTC implementation of Baidu's DeepSpeech for model experimentationβ241Updated 6 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ202Updated 3 years ago
- A neural attention model for speech command recognitionβ183Updated last year
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languagesβ468Updated 4 years ago
- A fully convolution-network for speech-to-text, built on pytorch.β126Updated 4 years ago
- DeepSpeech based forced alignment toolβ235Updated 4 years ago
- Speech-to-text based on wav2letter built for transfer learningβ97Updated 2 years ago
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0β243Updated 3 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.β170Updated 3 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.β375Updated last year
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β366Updated last month
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.β580Updated 2 years ago
- Speaker diarization scripts, based on AaltoASRβ190Updated 6 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β535Updated 2 years ago
- ASR with PyTorchβ140Updated 5 years ago
- Segment an audio file and obtain utterance alignments. (Python package)β325Updated 8 months ago
- INTERSPEECH 2019 Tutorial Materialsβ193Updated 3 years ago
- DeepSpeech, Speech To Text, ASR, Speech recognition, Keras, Tensorflowβ30Updated 7 years ago
- Identifying people from small audio fragmentsβ170Updated 4 years ago
- Deep Neural Network for Speaker Count Estimationβ146Updated 4 years ago
- Voice Activity Detection (VAD) using deep learning.β193Updated 5 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ228Updated 2 years ago
- Open tools and data for cloudless automatic speech recognitionβ446Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ71Updated 2 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )β291Updated 3 years ago
- PyTorch implementations of neural network models for keyword spottingβ515Updated last year
- A pure python module for reading and writing kaldi ark filesβ252Updated last year
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196β308Updated 4 years ago
- Problem Agnostic Speech Encoderβ440Updated last year
- Automatically constructing corpus for automatic speech recognition from YouTube videosβ153Updated 4 years ago