anjandeepsahni / automatic_speech_recognitionLinks
Speech to text transcription using RNN (Listen, Attend and Spell).
☆11Updated 6 years ago
Alternatives and similar repositories for automatic_speech_recognition
Users that are interested in automatic_speech_recognition are comparing it to the libraries listed below
Sorting:
- ☆90Updated 3 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆178Updated last year
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 3 years ago
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆144Updated 4 years ago
- A neural attention model for speech command recognition☆186Updated 6 months ago
- Udacity 2018 Machine Learning Nanodegree Capstone project☆147Updated 7 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Updated 5 years ago
- Understanding emotions from audio files using neural networks and multiple datasets.☆426Updated 2 years ago
- End-to-End Speech Recognition Using Tensorflow☆41Updated 2 years ago
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Updated 6 years ago
- Time series course Fall 2019 project☆53Updated 5 years ago
- End-to-End Speech Recognition☆12Updated 4 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆132Updated 5 years ago
- ☆11Updated 4 years ago
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆15Updated 3 years ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆79Updated 5 years ago
- A collection of Audio and Speech pre-trained models.☆193Updated 5 years ago
- TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18☆298Updated last year
- Collection of research papers on cough classification☆40Updated 5 years ago
- ☆49Updated 2 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆231Updated 4 years ago
- 🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).☆386Updated 3 years ago
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆212Updated 5 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Updated 4 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆97Updated 5 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆381Updated 4 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆220Updated 2 years ago
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆223Updated 5 years ago
- Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.☆271Updated this week