saiful9379 / BanglaASRLinks
Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset
☆11Updated last year
Alternatives and similar repositories for BanglaASR
Users that are interested in BanglaASR are comparing it to the libraries listed below
Sorting:
- Transformer based Bangla Speech Recognition | Encoder Decoder Architecture☆54Updated 2 years ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆62Updated 3 months ago
- Fine tuned llama 3 models for context based question answering in bengali language.☆15Updated 11 months ago
- This project is about performing Speaker diarization for Hindi Language.☆51Updated 4 years ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural…☆43Updated 2 years ago
- Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan☆58Updated last year
- Bangla Unicode Normalization☆20Updated last year
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆127Updated 2 years ago
- ☆49Updated 2 years ago
- 🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.☆31Updated 6 months ago
- Text Normalizer module use for Bangla as well as English digit convert to textual format, Normalize Date and Extract Date☆12Updated last month
- The aim of the project was to convert an image to speech. An image is processed and segmented to identify the text in the image. Then the…☆13Updated 7 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- Text-to-Speech for languages of India☆275Updated 10 months ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆169Updated last year
- Bangla text to speech, Multilingual (Bangla, English) real-time speech synthesis library☆89Updated 11 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆335Updated 2 years ago
- ☆54Updated 2 months ago
- ☆168Updated 9 months ago
- Finetune Wa2vec 2.0 For Speech Recognition☆138Updated 7 months ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆54Updated 2 years ago
- jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2☆18Updated last month
- End-to-End Speech Recognition☆12Updated 4 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆90Updated 3 weeks ago
- TTS models for Arabic (Tacotron2, FastPitch)☆119Updated 10 months ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆12Updated 3 years ago
- A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanaga…☆35Updated last year
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networ…☆48Updated 3 years ago
- This repository contains the training codes of the fine-tuned SpeechT5 on a Turkish dataset.☆19Updated last year
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech☆92Updated last year