ZhangAustin/Deep-Speech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZhangAustin/Deep-Speech)

ZhangAustin / Deep-Speech

Deep Learning for Speech Recogntion based on Theano

☆15

Alternatives and similar repositories for Deep-Speech

Users that are interested in Deep-Speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ifamille / WFST
View on GitHub
☆11Sep 16, 2014Updated 11 years ago
markusdr / transducersaurus
View on GitHub
Automatically exported from code.google.com/p/transducersaurus
☆11Apr 1, 2015Updated 11 years ago
jleni / lupa
View on GitHub
Lupa for Torch
☆10Sep 16, 2015Updated 10 years ago
syhw / timit_tools
View on GitHub
tools around preparing TIMIT for HMM (with HTK) and deep learning (with Theano) methods
☆79Aug 28, 2015Updated 10 years ago
rakshithShetty / dnn-speech
View on GitHub
This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition
☆12Dec 8, 2015Updated 10 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kdavis-mozilla / iris
View on GitHub
Demo WebApp using Kaldi DNN engine to convert speech to text
☆11Jun 12, 2016Updated 10 years ago
zxie / nn
View on GitHub
☆19May 16, 2015Updated 11 years ago
danieldimatteo / android-speech-diarization
View on GitHub
An Android app that listens to conversations and determines who was speaking at any point in the conversation - a task known as speech di…
☆14Apr 12, 2021Updated 5 years ago
jcsilva / deep-clustering
View on GitHub
☆70Feb 16, 2017Updated 9 years ago
sdrobert / pydrobert-kaldi
View on GitHub
SWIG bindings for Kaldi I/O, built with Conda
☆15Dec 15, 2024Updated last year
dspavankumar / keras-kaldi
View on GitHub
Keras Interface for Kaldi ASR
☆122Sep 27, 2017Updated 8 years ago
gerasimos / doc-rasa-on-m1
View on GitHub
Rasa on M1: installation guideline
☆14Jan 8, 2023Updated 3 years ago
irebai / SpecAugment_KALDI
View on GitHub
A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆15Sep 4, 2019Updated 6 years ago
ejhumphrey / dl4mir-dissertation
View on GitHub
Humphrey, E. J. "An Exploration of Deep Learning in Music Informatics." (2015) New York University.
☆14Feb 23, 2016Updated 10 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
View on GitHub
☆12Jun 10, 2021Updated 5 years ago
danijel3 / SparrowhawkTest
View on GitHub
A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine
☆14Oct 16, 2017Updated 8 years ago
leichtrhino / ChimeraNet
View on GitHub
Unofficial implementation of music separation model by Luo et.al.
☆13Nov 3, 2019Updated 6 years ago
OSU-slatelab / mimic-enhance
View on GitHub
Speech enhancement using mimic loss
☆16Oct 25, 2019Updated 6 years ago
mozilla / murmur
View on GitHub
DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training
☆20May 23, 2019Updated 7 years ago
ilarele / torch_examples
View on GitHub
[adversarial] examples and training cost
☆19Jun 29, 2016Updated 10 years ago
xanguera / BeamformIt
View on GitHub
BeamformIt acoustic beamforming software
☆384May 19, 2020Updated 6 years ago
UFAL-DSG / pykaldi
View on GitHub
Python wrapper for Kaldi decoders (Kaldi https://sourceforge.net/projects/kaldi/)
☆80Dec 13, 2015Updated 10 years ago
IoSR-Surrey / RealRoomBRIRs
View on GitHub
Binaural impulse responses captured in real rooms.
☆41Mar 9, 2016Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
qqueing / speaker_embedding-pytorch
View on GitHub
"An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation
☆19Oct 8, 2018Updated 7 years ago
hlt-bme-hu / hunspeech
View on GitHub
☆14Jan 24, 2017Updated 9 years ago
bsxfan / meta-embeddings
View on GitHub
Meta-embeddings are a probabilistic generalization of embeddings in machine learning.
☆23Nov 23, 2018Updated 7 years ago
MattShannon / htk_io
View on GitHub
Read and write HTK and HTS files from python.
☆20Mar 17, 2015Updated 11 years ago
NervanaSystems / deepspeech
View on GitHub
DeepSpeech neon implementation
☆220Jan 3, 2023Updated 3 years ago
tommy-fox / streaming-source-separation
View on GitHub
Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.
☆21Dec 8, 2022Updated 3 years ago
SeanNaren / deepspeech.torch
View on GitHub
Speech Recognition using DeepSpeech2 network and the CTC activation function.
☆261Jun 8, 2017Updated 9 years ago
NibuTake / LiNGAM-fast
View on GitHub
☆12May 17, 2018Updated 8 years ago
rizar / attention-lvcsr
View on GitHub
End-to-End Attention-Based Large Vocabulary Speech Recognition
☆265Nov 22, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
amaas / stanford-ctc
View on GitHub
Neural net code for lexicon-free speech recognition with connectionist temporal classification
☆250Feb 23, 2016Updated 10 years ago
alexander-beer-weiss / multi-GPU_server
View on GitHub
a multi-threaded, multi-GPU Waffle web server
☆12Apr 12, 2016Updated 10 years ago
joaoantoniocn / AM-MobileNet1D
View on GitHub
The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…
☆31Oct 3, 2023Updated 2 years ago
pseeth / otoworld
View on GitHub
Applying reinforcement learning to perform source separation.
☆23Nov 25, 2020Updated 5 years ago
foamliu / Speaker-Embeddings
View on GitHub
PyTorch implementation of a self-attentive speaker embedding
☆17Sep 24, 2019Updated 6 years ago
rOpenGov / enigma
View on GitHub
R client for the Enigma.io API - ABANDONED
☆16Feb 15, 2018Updated 8 years ago
raingo / image-caption-baseline
View on GitHub
Clean and easy to extend baseline for image captioning in tensorflow
☆10Jul 12, 2016Updated 10 years ago