01-vyom / End_2_End_Automatic_Speech_Recognition_For_Gujarati
[ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition System for Gujarati"
☆11Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for End_2_End_Automatic_Speech_Recognition_For_Gujarati
- A speech recognition system based on a Convolutional Neural Network built using TensorFlow☆16Updated 3 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆82Updated 7 months ago
- End-to-End Speech Recognition☆10Updated 3 years ago
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networ…☆32Updated 2 years ago
- Multi-class audio classification with MFCC features using CNN☆28Updated 4 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆111Updated last year
- This project is about performing Speaker diarization for Hindi Language.☆45Updated 3 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆202Updated last year
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆85Updated 2 years ago
- A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.☆26Updated 2 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆210Updated 4 years ago
- ☆11Updated 7 months ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆168Updated 8 months ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆73Updated 4 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆85Updated last year
- ☆118Updated 4 years ago
- Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer☆30Updated 3 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆156Updated 4 months ago
- Voice Activity Detection (VAD) using deep learning.☆191Updated 5 years ago
- Developed and trained Gated-CNN models to detect types of stutter in speech and SVM classifier to suggest new therapies to the user accor…☆18Updated 3 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆88Updated 2 years ago
- Open source speech to text models for Indic Languages☆287Updated 2 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆354Updated last year
- ☆26Updated 2 years ago
- ☆23Updated last year
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 2 years ago
- ☆41Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆249Updated last year
- Text-to-Speech for languages of India☆151Updated this week