A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.
☆19Jan 18, 2018Updated 8 years ago
Alternatives and similar repositories for Automatic_Speech_Recognition_with_Multi_Models
Users that are interested in Automatic_Speech_Recognition_with_Multi_Models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Mar 22, 2017Updated 9 years ago
- This is application for dysarthria to improve their pronunciation by using deep learning☆10Dec 29, 2020Updated 5 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Nov 27, 2019Updated 6 years ago
- Raw waveform adaptation with SincNet☆12Mar 19, 2024Updated 2 years ago
- A query by humming system based on locality sensitive hashing indexes☆12May 8, 2014Updated 12 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A PyTorch implementation of speech recognition based on DeepMind's WaveNet☆18Jun 5, 2018Updated 8 years ago
- This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition☆12Dec 8, 2015Updated 10 years ago
- readers that enable reading kaldi ark in tensorflow☆17Mar 7, 2018Updated 8 years ago
- Repo for the Insults Detection challenge on Kaggle.com☆11Mar 17, 2013Updated 13 years ago
- Simple LSTM language modelling toolkit☆10Oct 21, 2022Updated 3 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆47Jun 24, 2020Updated 6 years ago
- ASR for dysarthric speakers with Kaldi☆13Jan 14, 2017Updated 9 years ago
- Only in native python & numpy☆11Apr 7, 2018Updated 8 years ago
- A TensorFlow implementation for Chinese speech recognition based on DeepMind's WaveNet☆15Mar 27, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Asymmetric Convolutional Bidirectional LSTM Networks for Text Classification☆11Mar 26, 2018Updated 8 years ago
- A set of examples for basic audio data handling☆13Aug 15, 2020Updated 5 years ago
- build a pytorch framework for sentiment analysis (SemEval2016)☆11Dec 20, 2017Updated 8 years ago
- Fortran Library, Application, and Toolkit Packages☆16Dec 29, 2016Updated 9 years ago
- 通过alfred workflow用欧路词典查词☆14Sep 4, 2018Updated 7 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆50Oct 19, 2017Updated 8 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆14Dec 9, 2015Updated 10 years ago
- Word sense disambiguation using Bidirectional LSTM☆10Dec 23, 2019Updated 6 years ago
- Recognizing common speech commands using Keras and Tensorflow.☆10Dec 17, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Data preparation code for building Kaldi ASR system☆14Mar 18, 2017Updated 9 years ago
- ☆16May 15, 2019Updated 7 years ago
- Pytorch-Kaldi implementation of SNN-based ASR systems☆18Feb 1, 2020Updated 6 years ago
- Indoor Tracking is an application for Android phones, that tracks your walking in indoor environment.☆31Jan 16, 2016Updated 10 years ago
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset☆12Dec 11, 2019Updated 6 years ago
- items browsed in a session as a context are modeled to vec with bidirectional lstm☆18Nov 17, 2016Updated 9 years ago
- 2017 SURF of EEE Department XJTLU: Indoor Localization☆20Jan 15, 2020Updated 6 years ago
- Speech Recognition for speakers with speech disorders due to diseases like Cerebral Palsy, Parkinson or Amyotrophic Lateral Sclerosis ALS…☆23Mar 26, 2017Updated 9 years ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Sep 22, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A rough and ready Python utility which splits audio files based on silence and desired min/max chunk duration.☆16Jun 22, 2022Updated 4 years ago
- Source code for "Towards a Deeper Understanding of Adversarial Losses under a Discriminative Adversarial Network Setting"☆42Sep 1, 2022Updated 3 years ago
- Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.☆30Dec 18, 2019Updated 6 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- Python wrapper generator for Fortran☆31Jun 3, 2026Updated 3 weeks ago
- This repository contains code for a tutorial on end to end automatic speech recognition.☆18Sep 10, 2019Updated 6 years ago
- Generate SVG images using Python with traditional Japanese colors☆22Oct 22, 2017Updated 8 years ago