Deep Learning for Speech Recogntion based on Theano
☆15Jul 28, 2017Updated 8 years ago
Alternatives and similar repositories for Deep-Speech
Users that are interested in Deep-Speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Sep 16, 2014Updated 11 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 10 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- tools around preparing TIMIT for HMM (with HTK) and deep learning (with Theano) methods☆78Aug 28, 2015Updated 10 years ago
- This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition☆12Dec 8, 2015Updated 10 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Demo WebApp using Kaldi DNN engine to convert speech to text☆11Jun 12, 2016Updated 9 years ago
- A Fluent Java API for Cascading☆22Jun 14, 2017Updated 8 years ago
- ☆70Feb 16, 2017Updated 9 years ago
- SWIG bindings for Kaldi I/O, built with Conda☆15Dec 15, 2024Updated last year
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆15Sep 4, 2019Updated 6 years ago
- Rasa on M1: installation guideline☆14Jan 8, 2023Updated 3 years ago
- Keras Interface for Kaldi ASR☆122Sep 27, 2017Updated 8 years ago
- Humphrey, E. J. "An Exploration of Deep Learning in Music Informatics." (2015) New York University.☆14Feb 23, 2016Updated 10 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Unofficial implementation of music separation model by Luo et.al.☆13Nov 3, 2019Updated 6 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training☆20May 23, 2019Updated 6 years ago
- Speech enhancement using mimic loss☆16Oct 25, 2019Updated 6 years ago
- [adversarial] examples and training cost☆19Jun 29, 2016Updated 9 years ago
- BeamformIt acoustic beamforming software☆383May 19, 2020Updated 5 years ago
- Top level code to transcribe English audio/video files into text/subtitles☆21Jun 12, 2018Updated 7 years ago
- Binaural impulse responses captured in real rooms.☆37Mar 9, 2016Updated 10 years ago
- ☆15Jan 24, 2017Updated 9 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Oct 8, 2018Updated 7 years ago
- Python wrapper for Kaldi decoders (Kaldi https://sourceforge.net/projects/kaldi/)☆80Dec 13, 2015Updated 10 years ago
- Read and write HTK and HTS files from python.☆20Mar 17, 2015Updated 11 years ago
- DeepSpeech neon implementation☆221Jan 3, 2023Updated 3 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆21Dec 8, 2022Updated 3 years ago
- Speech Recognition using DeepSpeech2 network and the CTC activation function.☆261Jun 8, 2017Updated 8 years ago
- End-to-End Attention-Based Large Vocabulary Speech Recognition☆265Nov 22, 2022Updated 3 years ago
- ☆12May 17, 2018Updated 7 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Neural net code for lexicon-free speech recognition with connectionist temporal classification☆250Feb 23, 2016Updated 10 years ago
- random pytorch hacks☆26Jul 26, 2017Updated 8 years ago
- a multi-threaded, multi-GPU Waffle web server☆12Apr 12, 2016Updated 9 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Oct 3, 2023Updated 2 years ago
- Applying reinforcement learning to perform source separation.☆23Nov 25, 2020Updated 5 years ago
- PyTorch implementation of a self-attentive speaker embedding☆17Sep 24, 2019Updated 6 years ago
- Probabilistic Linear Discriminant Analysis☆14Nov 14, 2014Updated 11 years ago