01-vyom / End_2_End_Automatic_Speech_Recognition_For_GujaratiLinks
[ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition System for Gujarati"
☆12Updated 3 years ago
Alternatives and similar repositories for End_2_End_Automatic_Speech_Recognition_For_Gujarati
Users that are interested in End_2_End_Automatic_Speech_Recognition_For_Gujarati are comparing it to the libraries listed below
Sorting:
- A speech recognition system based on a Convolutional Neural Network built using TensorFlow☆20Updated 4 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆39Updated last year
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆86Updated last year
- ☆90Updated 2 years ago
- ☆43Updated 2 years ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆56Updated last month
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆17Updated 3 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 2 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆214Updated last year
- ☆50Updated 2 years ago
- Vaksanca introduces free Sanskrit speech corpus with vowel segmentation.☆15Updated 3 years ago
- Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer☆30Updated 4 years ago
- An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras☆43Updated 3 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆168Updated 11 months ago
- ☆30Updated 2 years ago
- End-to-End Speech Recognition☆12Updated 4 years ago
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networ…☆44Updated 3 years ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆75Updated 4 years ago
- A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.☆27Updated 3 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆94Updated last week
- This repository consists of the IPython Notebook for the work related to audio processing and implementing convolution neural networks fo…☆13Updated 6 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆175Updated last year
- Speech to text transcription using RNN (Listen, Attend and Spell).☆11Updated 5 years ago
- Text to Speech for Indic languages☆51Updated 3 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆131Updated 4 months ago
- The aim of the project was to convert an image to speech. An image is processed and segmented to identify the text in the image. Then the…☆13Updated 6 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech☆92Updated last year