yao-matrix/deepSpeech2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yao-matrix/deepSpeech2)

yao-matrix / deepSpeech2

End-to-end speech recognition using TensorFlow

☆48

Alternatives and similar repositories for deepSpeech2

Users that are interested in deepSpeech2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

srinivr / kaldi-long-audio-alignment
View on GitHub
Long audio alignment using Kaldi
☆23Apr 22, 2021Updated 5 years ago
yh1008 / speech-to-text
View on GitHub
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
☆71Nov 20, 2017Updated 8 years ago
SeanNaren / deepspeech.pytorch
View on GitHub
Speech Recognition using DeepSpeech2.
☆2,136Dec 13, 2022Updated 3 years ago
chenzhehuai / kaldi
View on GitHub
This is now the official location of the Kaldi project.
☆24Nov 13, 2019Updated 6 years ago
hackerlibs / rag-code-sorting-search
View on GitHub
RAG code sorting search, RAG knowledge organization
☆16Nov 22, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
oxinabox / Kaldi-Notes
View on GitHub
Some notes on Kaldi
☆32Feb 20, 2015Updated 11 years ago
wangxiao5791509 / Age-Progression-Regression-by-CAAE
View on GitHub
the repaired code of paper "Age Progression/Regression by Conditional Adversarial Autoencoder---CVPR 2017"
☆10Sep 19, 2017Updated 8 years ago
andrewcsmith / tf_infinite_ramble
View on GitHub
the infinite ramble in rust, powered by tensorflow. (mfcc cosine similarity matching)
☆13Apr 30, 2018Updated 8 years ago
k2-fsa / kaldi-decoder
View on GitHub
Decoders from Kaldi using OpenFst
☆35Apr 10, 2026Updated 3 months ago
mahimg / Speaker-recognition
View on GitHub
Segment speech sequences based on speaker transitions, using ML and DSP.
☆17Jul 30, 2018Updated 7 years ago
tli725 / JL-Corpus
View on GitHub
For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…
☆11Oct 29, 2018Updated 7 years ago
Zhengyu-Li / Deep-Network-Compression-based-on-Student-Teacher-Network-
View on GitHub
Deep Neural Network Compression based on Student-Teacher Network
☆14Jul 6, 2023Updated 3 years ago
mozilla / voice-corpus-tool
View on GitHub
Tool for creation, manipulation and maintenance of voice corpora
☆82May 3, 2024Updated 2 years ago
reith / deepspeech-playground
View on GitHub
Baidu's DeepSpeech updated for better training
☆23Sep 5, 2018Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
shitian-ni / speech-recognition-transfer-learning
View on GitHub
Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow
☆17Jan 19, 2018Updated 8 years ago
IBM / audioset-classification
View on GitHub
Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning
☆102Sep 17, 2025Updated 10 months ago
NervanaSystems / deepspeech
View on GitHub
DeepSpeech neon implementation
☆220Jan 3, 2023Updated 3 years ago
distillpub / post--ctc
View on GitHub
Sequence Modelling with CTC
☆52Dec 29, 2022Updated 3 years ago
adanRivas / CNN-Audio-Classifier-with-Keras-Tensorflow
View on GitHub
ipython notebooks for feature extraction and training of audio event classifier on ESC-50 dataset.
☆10Mar 16, 2018Updated 8 years ago
hongfeixue / StutteringSpeechChallenge
View on GitHub
SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
☆12Jun 11, 2024Updated 2 years ago
gooofy / py-kaldi-asr
View on GitHub
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
☆169Feb 23, 2021Updated 5 years ago
tiefenauer / wiki-lm
View on GitHub
Script to train a German n-gram Language Model on articles of Wikipedia
☆14Oct 20, 2018Updated 7 years ago
rosinality / melgan-pytorch
View on GitHub
MelGAN and Tacotron 2 in PyTorch
☆11Oct 22, 2019Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ShankHarinath / DeepSpeech2-Keras
View on GitHub
DeepSpeech, Speech To Text, ASR, Speech recognition, Keras, Tensorflow
☆30Jan 16, 2018Updated 8 years ago
JRMeyer / common-voice-stats
View on GitHub
A living document for all things Common Voice.
☆14Jun 24, 2024Updated 2 years ago
csukuangfj / kaldi-hmm-gmm
View on GitHub
☆28Apr 24, 2026Updated 3 months ago
KrishnaDN / BERTphone
View on GitHub
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Dec 10, 2020Updated 5 years ago
corticph / MSTmodel
View on GitHub
Code for https://arxiv.org/abs/1712.00254
☆18Dec 6, 2017Updated 8 years ago
JaesungBae / Speech-Command-Recognition-with-Capsule-Network
View on GitHub
Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.
☆25Jan 28, 2019Updated 7 years ago
Trangle / mxnet-inception-v4
View on GitHub
☆23Aug 24, 2016Updated 9 years ago
Braden1996 / Audio-Synthesiser
View on GitHub
A fourier-based audio-synthesiser wrote in MATLAB as a university project.
☆11Jan 19, 2019Updated 7 years ago
easonnie / ResEncoder
View on GitHub
This repo is for residual-connected sentence encoder for NLI.
☆11Jan 21, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sleepinyourhat / quora-duplicate-questions-util
View on GitHub
Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.
☆14Jan 27, 2017Updated 9 years ago
RicherMans / Dcase2018_pooling
View on GitHub
Repo for our pooling approach on the DCASE2018 task4
☆16Jul 6, 2023Updated 3 years ago
jfainberg / sincnet_adapt
View on GitHub
Raw waveform adaptation with SincNet
☆12Mar 19, 2024Updated 2 years ago
sunny8898 / DeepSpeech-tensorflow
View on GitHub
将百度DeepSpeech的keras后端由theano改为tensorflow，整合mozilla解码模块进行中文语音识别模型部署
☆10Dec 2, 2019Updated 6 years ago
aws-samples / blue-green-deployment-pipeline-for-efs
View on GitHub
Blue/Green deployment with AWS Developer tools on Amazon EC2 using Amazon EFS to host application source code
☆11Jul 27, 2021Updated 4 years ago
matthewzhou / Nerve-Segmentation
View on GitHub
Image recognition for nerves from ultrasound images using a sliding window CNN
☆10Aug 15, 2016Updated 9 years ago
SamPosh / PyDevbox
View on GitHub
Devbox: Prepare your python development environment -Neovim with kickstarter.nvim
☆16May 28, 2023Updated 3 years ago