saiful9379 / BanglaASRLinks
Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset
☆11Updated last year
Alternatives and similar repositories for BanglaASR
Users that are interested in BanglaASR are comparing it to the libraries listed below
Sorting:
- Transformer based Bangla Speech Recognition☆53Updated 2 years ago
- ☆157Updated 7 months ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆59Updated last month
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆126Updated 2 years ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆54Updated 2 years ago
- Bangla TTS Inference pipeline using Vit TTS☆8Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆319Updated 2 years ago
- Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan☆57Updated last year
- Identify the emotion of multiple speakers in an Audio Segment☆172Updated 2 years ago
- TTS models for Arabic (Tacotron2, FastPitch)☆119Updated 8 months ago
- Bangla Unicode Normalization☆20Updated last year
- Speech Emotion Recognition (SER) in real-time, using Deep Neural Networks (DNN) of Long Short Memory Term (LSTM).☆111Updated 3 years ago
- Fine tuned llama 3 models for context based question answering in bengali language.☆12Updated 9 months ago
- Text-to-Speech for languages of India☆258Updated 8 months ago
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural…☆41Updated last year
- Finetune VITS and MMS using HuggingFace's tools☆159Updated last year
- End-to-End Speech Recognition☆12Updated 4 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆13Updated 3 years ago
- An end-to-end system which makes use of a recurrent encoder-decoder deep neural network to translate speech from the Hindi (Fourth most s…☆18Updated 6 years ago
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆35Updated 3 weeks ago
- Finetune Wa2vec 2.0 For Speech Recognition☆141Updated 5 months ago
- ☆49Updated 2 years ago
- ☆32Updated 2 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆168Updated last year
- The aim of the project was to convert an image to speech. An image is processed and segmented to identify the text in the image. Then the…☆13Updated 6 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆86Updated last year
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆22Updated 11 months ago
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networ…☆47Updated 3 years ago