01-vyom / End_2_End_Automatic_Speech_Recognition_For_GujaratiLinks
[ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition System for Gujarati"
☆12Updated 4 years ago
Alternatives and similar repositories for End_2_End_Automatic_Speech_Recognition_For_Gujarati
Users that are interested in End_2_End_Automatic_Speech_Recognition_For_Gujarati are comparing it to the libraries listed below
Sorting:
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆356Updated 2 years ago
- ☆117Updated 5 years ago
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networ…☆51Updated 3 years ago
- This GitHub repository contains converted models in ONNX, TensorRT, and PyTorch formats, along with inference scripts and demos. These mo…☆14Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.☆58Updated 4 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆101Updated 4 months ago
- End-to-End Speech Recognition☆12Updated 4 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- ☆49Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆375Updated last year
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆218Updated 2 years ago
- Identifying people from small audio fragments☆170Updated 5 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 3 years ago
- Open source speech to text models for Indic Languages☆313Updated 3 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Updated 2 years ago
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆212Updated 5 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆381Updated 4 years ago
- Voice Biometrics Authentication using GMM and Face Recognition Using Facenet and Tensorflow☆113Updated 5 years ago
- Multi-class audio classification with MFCC features using CNN☆31Updated 5 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆18Updated 4 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆97Updated 7 months ago
- Wav2Vec for speech recognition, classification, and audio classification☆269Updated 3 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Updated 5 years ago
- A speech recognition system based on a Convolutional Neural Network built using TensorFlow☆21Updated 5 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆178Updated 2 years ago
- Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer☆29Updated 4 years ago
- ☆90Updated 3 years ago
- Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.☆13Updated 4 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆176Updated last year
- Voice Activity Detection based on Deep Learning & TensorFlow☆370Updated 2 years ago