01-vyom / End_2_End_Automatic_Speech_Recognition_For_Gujarati
[ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition System for Gujarati"
☆12Updated 3 years ago
Alternatives and similar repositories for End_2_End_Automatic_Speech_Recognition_For_Gujarati
Users that are interested in End_2_End_Automatic_Speech_Recognition_For_Gujarati are comparing it to the libraries listed below
Sorting:
- End-to-End Speech Recognition☆12Updated 4 years ago
- A speech recognition system based on a Convolutional Neural Network built using TensorFlow☆19Updated 4 years ago
- This project is about performing Speaker diarization for Hindi Language.☆49Updated 4 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆168Updated 10 months ago
- An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras☆43Updated 3 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆174Updated last year
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 2 years ago
- A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.☆27Updated 3 years ago
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networ…☆44Updated 3 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆211Updated 4 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆213Updated last year
- ☆90Updated 2 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆93Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆39Updated last year
- An attempt to Vietnamese speech enhencement with U-net and Unet based ResNet☆22Updated 3 years ago
- ☆45Updated 7 years ago
- ☆43Updated 2 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆17Updated 3 years ago
- Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset☆10Updated last year
- Finetune Wa2vec 2.0 For Speech Recognition☆131Updated 3 months ago
- Feature extraction from sound signals along with complete CNN model and evaluations using tensorflow, keras and, librosa for MFCC generat…☆10Updated 3 years ago
- This repository provides you the details of how speech recognition is done from end to end.☆25Updated 6 years ago
- Multi-class audio classification with MFCC features using CNN☆30Updated 5 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆13Updated 3 years ago
- ☆118Updated 4 years ago
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆208Updated 4 years ago
- Time series course Fall 2019 project☆54Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆363Updated 2 years ago
- A python model to detect and segment coughs, forked from coughvid's repo☆10Updated 6 months ago