saiful9379 / BanglaASRLinks
Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset
β12Updated last year
Alternatives and similar repositories for BanglaASR
Users that are interested in BanglaASR are comparing it to the libraries listed below
Sorting:
- Transformer based Bangla Speech Recognition | Encoder Decoder Architectureβ54Updated 2 years ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUIπΈTTS(Text-to-Speech) based high performing neuralβ¦β42Updated 2 years ago
- Bangla TTS Inference pipeline using Vit TTSβ11Updated last year
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASRβ65Updated 5 months ago
- π Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.β35Updated 9 months ago
- Bangla Unicode Normalizationβ21Updated last year
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023β55Updated 2 years ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β103Updated 3 months ago
- This project is about performing Speaker diarization for Hindi Language.β55Updated 4 years ago
- ποΈ Arabic TTS models (Tacotron2, FastPitch)β128Updated last year
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNsβ23Updated last year
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2β126Updated 2 years ago
- β177Updated 11 months ago
- β60Updated 4 months ago
- Fine tuned llama 3 models for context based question answering in bengali language.β18Updated last year
- Identify the emotion of multiple speakers in an Audio Segmentβ178Updated 2 years ago
- This repository contains the training codes of the fine-tuned SpeechT5 on a Turkish dataset.β21Updated last year
- Deep Learning - one shot learning for speaker recognition using Filter Banksβ170Updated last year
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speechβ92Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLMβ38Updated 2 years ago
- Text-to-Speech for languages of Indiaβ298Updated last year
- Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGanβ61Updated last year
- NPTEL2020: Speech2Text dataset for Indian-English Accentβ77Updated 3 years ago
- Finetune VITS and MMS using HuggingFace's toolsβ177Updated last year
- A speech recognition system based on a Convolutional Neural Network built using TensorFlowβ21Updated 4 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.β17Updated 4 years ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β352Updated 2 years ago
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languagesβ14Updated last year
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networβ¦β51Updated 3 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2β99Updated 3 months ago