apoorvnandan / speech-recognition-primerLinks
This repository contains code for a tutorial on end to end automatic speech recognition.
☆17Updated 5 years ago
Alternatives and similar repositories for speech-recognition-primer
Users that are interested in speech-recognition-primer are comparing it to the libraries listed below
Sorting:
- End-to-End Speech Recognition using Neural Networks.☆35Updated 11 months ago
- It uses GMM to train a gender detector model. The testing has been done on subset of Google's AudioSet corpus.☆19Updated 8 years ago
- It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data co…☆57Updated 5 years ago
- A neural attention model for speech command recognition☆185Updated 3 weeks ago
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- Identifying people from small audio fragments☆170Updated 5 years ago
- End to End Dialect Identification using Convolutional Neural Network☆52Updated 5 years ago
- CNN 1D vs 2D audio classification☆104Updated 6 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆211Updated 5 years ago
- A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popu…☆19Updated 7 years ago
- 💬 A list of End-to-End speech recognition, including papers, codes and other materials☆52Updated 6 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆33Updated 7 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/☆21Updated 7 years ago
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆74Updated 4 years ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆48Updated 3 years ago
- ⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's vo…☆38Updated 5 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- This repository provides you the details of how speech recognition is done from end to end.☆25Updated 6 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- ☆38Updated 5 years ago
- Speaker diarization with GMM-UBM and MAP Adaptation☆30Updated 6 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 5 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Updated 4 years ago
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆176Updated 8 months ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 6 years ago
- Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks☆105Updated 7 years ago