ashwin9999 / speech-recognition-CNNLinks
A speech recognition system based on a Convolutional Neural Network built using TensorFlow
☆21Updated 4 years ago
Alternatives and similar repositories for speech-recognition-CNN
Users that are interested in speech-recognition-CNN are comparing it to the libraries listed below
Sorting:
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networ…☆51Updated 3 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated last year
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Updated 5 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆105Updated 2 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆218Updated 2 years ago
- An implementation of MatchboxNet☆12Updated 3 years ago
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Updated 5 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆68Updated 4 years ago
- ☆117Updated 5 years ago
- A neural attention model for speech command recognition☆187Updated 4 months ago
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆212Updated 5 years ago
- This project is about performing Speaker diarization for Hindi Language.☆52Updated 4 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆89Updated last year
- Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…☆18Updated 5 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆142Updated 9 months ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆78Updated 5 years ago
- Kaldi based speaker verification☆47Updated 7 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆175Updated last year
- The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)☆37Updated 2 years ago
- Speaker identification using voice MFCCs and GMM☆54Updated 4 years ago
- Wav2kws is keyword spotting (KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Google Speech Commands datasets V1 and V2.☆13Updated 4 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆33Updated 7 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆141Updated 4 years ago
- Learning Efficient Representations for Keyword Spotting with Triplet Loss☆112Updated 3 years ago
- Speech Emotion Recognition☆43Updated 2 years ago
- Voice Activity Detection (VAD) using deep learning.☆201Updated 6 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆179Updated 11 months ago