ashwin9999 / speech-recognition-CNNLinks
A speech recognition system based on a Convolutional Neural Network built using TensorFlow
☆21Updated 5 years ago
Alternatives and similar repositories for speech-recognition-CNN
Users that are interested in speech-recognition-CNN are comparing it to the libraries listed below
Sorting:
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networ…☆53Updated 3 years ago
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Updated 5 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆180Updated last year
- ☆117Updated 5 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆69Updated 5 years ago
- A neural attention model for speech command recognition☆186Updated 7 months ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆79Updated 5 years ago
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆144Updated 4 years ago
- Fine-tune WhisperAI model to your language☆21Updated 2 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Updated 4 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆220Updated 2 years ago
- Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…☆18Updated 5 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆274Updated 3 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆184Updated last year
- Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769☆136Updated 3 years ago
- An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras☆45Updated 4 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆90Updated last year
- ☆90Updated 3 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Updated 5 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆68Updated 4 years ago
- Voice Activity Detection (VAD) using deep learning.☆204Updated 6 years ago
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆212Updated 5 years ago
- ☆49Updated 2 years ago
- VArious audio processing tasks☆21Updated 3 years ago
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆107Updated 3 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 3 years ago
- The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)☆38Updated 3 years ago
- Voice Biometrics Authentication using GMM and Face Recognition Using Facenet and Tensorflow☆113Updated 5 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Updated 4 years ago