Deep Learning for Speech Recogntion based on Theano
☆15Jul 28, 2017Updated 8 years ago
Alternatives and similar repositories for Deep-Speech
Users that are interested in Deep-Speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Sep 16, 2014Updated 11 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- Lupa for Torch☆10Sep 16, 2015Updated 10 years ago
- tools around preparing TIMIT for HMM (with HTK) and deep learning (with Theano) methods☆78Aug 28, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition☆12Dec 8, 2015Updated 10 years ago
- Demo WebApp using Kaldi DNN engine to convert speech to text☆11Jun 12, 2016Updated 10 years ago
- A Fluent Java API for Cascading☆22Jun 14, 2017Updated 9 years ago
- ☆19May 16, 2015Updated 11 years ago
- An Android app that listens to conversations and determines who was speaking at any point in the conversation - a task known as speech di…☆14Apr 12, 2021Updated 5 years ago
- ☆70Feb 16, 2017Updated 9 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆15Sep 4, 2019Updated 6 years ago
- Rasa on M1: installation guideline☆14Jan 8, 2023Updated 3 years ago
- Keras Interface for Kaldi ASR☆122Sep 27, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Humphrey, E. J. "An Exploration of Deep Learning in Music Informatics." (2015) New York University.☆14Feb 23, 2016Updated 10 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- Unofficial implementation of music separation model by Luo et.al.☆13Nov 3, 2019Updated 6 years ago
- ☆12Jun 10, 2021Updated 5 years ago
- DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training☆20May 23, 2019Updated 7 years ago
- Speech enhancement using mimic loss☆16Oct 25, 2019Updated 6 years ago
- [adversarial] examples and training cost☆19Jun 29, 2016Updated 9 years ago
- Top level code to transcribe English audio/video files into text/subtitles☆21Jun 12, 2018Updated 8 years ago
- BeamformIt acoustic beamforming software☆384May 19, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Binaural impulse responses captured in real rooms.☆40Mar 9, 2016Updated 10 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Oct 8, 2018Updated 7 years ago
- Python wrapper for Kaldi decoders (Kaldi https://sourceforge.net/projects/kaldi/)☆80Dec 13, 2015Updated 10 years ago
- Read and write HTK and HTS files from python.☆20Mar 17, 2015Updated 11 years ago
- DeepSpeech neon implementation☆221Jan 3, 2023Updated 3 years ago
- Parallel Optimization of Motion Estimation (ME) module based on CUDA☆16Mar 25, 2016Updated 10 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆21Dec 8, 2022Updated 3 years ago
- tensorflow and bazel for aarch64: binaries at...☆15Jan 30, 2018Updated 8 years ago
- blog: https://1planet.co.jp/tech-blog/applevisionpro-oneplanet-mac-spatialvideo☆10May 1, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Speech Recognition using DeepSpeech2 network and the CTC activation function.☆261Jun 8, 2017Updated 9 years ago
- End-to-End Attention-Based Large Vocabulary Speech Recognition☆265Nov 22, 2022Updated 3 years ago
- ☆12May 17, 2018Updated 8 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- Neural net code for lexicon-free speech recognition with connectionist temporal classification☆250Feb 23, 2016Updated 10 years ago
- random pytorch hacks☆26Jul 26, 2017Updated 8 years ago
- a multi-threaded, multi-GPU Waffle web server☆12Apr 12, 2016Updated 10 years ago