01-vyom / End_2_End_Automatic_Speech_Recognition_For_Gujarati
[ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition System for Gujarati"
☆11Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for End_2_End_Automatic_Speech_Recognition_For_Gujarati
- This project is about performing Speaker diarization for Hindi Language.☆45Updated 3 years ago
- End-to-End Speech Recognition☆10Updated 3 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Updated 4 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆85Updated 2 years ago
- A speech recognition system based on a Convolutional Neural Network built using TensorFlow☆16Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆115Updated last year
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆206Updated last year
- ☆27Updated 2 years ago
- Time series course Fall 2019 project☆53Updated 4 years ago
- A unified dataset of multilingual emotional human utterances☆23Updated 2 years ago
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆11Updated 2 years ago
- This GitHub repository contains converted models in ONNX, TensorRT, and PyTorch formats, along with inference scripts and demos. These mo…☆13Updated last year
- Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer☆30Updated 3 years ago
- ☆90Updated 2 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆89Updated 2 years ago
- ☆12Updated 8 months ago
- Identify the emotion of multiple speakers in an Audio Segment☆164Updated last year
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆160Updated 5 months ago
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networ…☆33Updated 2 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆11Updated 2 years ago
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆207Updated 4 years ago
- Multi-class audio classification with MFCC features using CNN☆28Updated 4 years ago
- ☆41Updated last year
- Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)☆16Updated 4 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆58Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆165Updated 4 months ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆86Updated last year
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆92Updated last year
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 2 years ago