saiful9379 / BanglaASRLinks
Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset
β12Updated last year
Alternatives and similar repositories for BanglaASR
Users that are interested in BanglaASR are comparing it to the libraries listed below
Sorting:
- Transformer based Bangla Speech Recognition | Encoder Decoder Architectureβ55Updated 2 years ago
- π Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.β36Updated 9 months ago
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNsβ23Updated last year
- Identify the emotion of multiple speakers in an Audio Segmentβ178Updated 2 years ago
- β60Updated 5 months ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://β¦β13Updated 3 years ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUIπΈTTS(Text-to-Speech) based high performing neuralβ¦β42Updated 2 years ago
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networβ¦β51Updated 3 years ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2β127Updated 2 years ago
- β183Updated last year
- ποΈ Arabic TTS models (Tacotron2, FastPitch)β130Updated 2 weeks ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β353Updated 2 years ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASRβ68Updated 6 months ago
- This project is about performing Speaker diarization for Hindi Language.β58Updated 4 years ago
- Bangla TTS Inference pipeline using Vit TTSβ11Updated last year
- Bangla Unicode Normalizationβ21Updated last year
- β32Updated 3 years ago
- Fine tuned llama 3 models for context based question answering in bengali language.β18Updated last year
- Finetune VITS and MMS using HuggingFace's toolsβ182Updated last year
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023β55Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.β374Updated last year
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.β87Updated 3 years ago
- An end-to-end system which makes use of a recurrent encoder-decoder deep neural network to translate speech from the Hindi (Fourth most sβ¦β18Updated 6 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLMβ37Updated 2 years ago
- β49Updated 2 years ago
- Finetune Wa2vec 2.0 For Speech Recognitionβ143Updated 10 months ago
- Wav2Vec for speech recognition, classification, and audio classificationβ269Updated 3 years ago
- Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGanβ62Updated 2 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)β218Updated 2 years ago
- Developed and trained Gated-CNN models to detect types of stutter in speech and SVM classifier to suggest new therapies to the user accorβ¦β19Updated 4 years ago