rolczynski / Automatic-Speech-RecognitionLinks
π§ Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
β224Updated 4 years ago
Alternatives and similar repositories for Automatic-Speech-Recognition
Users that are interested in Automatic-Speech-Recognition are comparing it to the libraries listed below
Sorting:
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β535Updated 3 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ209Updated 3 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.β376Updated last year
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languagesβ472Updated 5 years ago
- A Keras CTC implementation of Baidu's DeepSpeech for model experimentationβ242Updated 7 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β368Updated last week
- DeepSpeech based forced alignment toolβ237Updated 4 years ago
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0β245Updated 4 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.β170Updated 4 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlowβ310Updated 6 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.β129Updated 4 years ago
- Open tools and data for cloudless automatic speech recognitionβ446Updated 4 years ago
- Paper: https://arxiv.org/abs/1702.02285β64Updated 6 years ago
- A neural attention model for speech command recognitionβ185Updated 2 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.β586Updated 3 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systemsβ210Updated 3 months ago
- Utterance-level Aggregation For Speaker Recognition In The Wildβ368Updated 2 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"β367Updated 6 years ago
- ASR with PyTorchβ139Updated 6 years ago
- End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)β314Updated 7 years ago
- Python library for handling audio datasets.β138Updated last year
- Word alignments generated by the Montreal Forced Aligner for the Librispeech datasetβ163Updated 6 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.β122Updated 5 years ago
- Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"β368Updated 3 years ago
- Identifying people from small audio fragmentsβ170Updated 5 years ago
- VCTK multi-speaker tacotron for ICASSP 2020β266Updated 3 years ago
- A fully convolution-network for speech-to-text, built on pytorch.β126Updated 5 years ago
- End-2-end speech synthesis with recurrent neural networksβ226Updated last year
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognitionβ484Updated 3 years ago
- An opensource speech-to-text software written in tensorflowβ158Updated 2 years ago