saiful9379 / BanglaASR
Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset
☆10Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for BanglaASR
- Transformer based Bangla Speech Recognition☆51Updated last year
- ☆15Updated 6 months ago
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance☆20Updated this week
- Bangla Unicode Normalization☆17Updated 5 months ago
- Bangla Machine Translator☆43Updated 2 years ago
- Bangla text to speech, Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis library☆87Updated last month
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆90Updated last month
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆11Updated 2 years ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural…☆35Updated last year
- The official implementation of CATT Arabic diacritization models.☆35Updated 3 months ago
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆10Updated 3 months ago
- speech recognition using Kaldi framework☆12Updated 4 years ago
- A collection of Bangla newspaper and blog crawlers. Can be used to mine bangla text data for Natural Language Processing tasks.☆14Updated last year
- Fine tuned llama 3 models for context based question answering in bengali language.☆10Updated last month
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆47Updated last year
- Into the depths of some concepts of Artificial Intelligence and Machine Learning☆10Updated 4 months ago
- The aim of the project was to convert an image to speech. An image is processed and segmented to identify the text in the image. Then the…☆12Updated 6 years ago
- Speech Enhancement: Tensorflow 2.x implementation of the stacked dual-signal transformation LSTM network (DTLN) for Noise Suppression.☆15Updated 3 years ago
- Fully Configurable RAG Pipeline for Bengali Language RAG Applications. Supports both Local and Huggingface Models, Built with Langchain.☆37Updated 3 months ago
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech☆91Updated last year
- This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batc…☆35Updated 6 months ago
- Speech Emotion Detection using SVM, Decision Tree, Random Forest, MLP, CNN with different architectures☆30Updated 10 months ago
- This project is about performing Speaker diarization for Hindi Language.☆45Updated 3 years ago
- Project Made during Virtual Summer Internship under leadingindia.ai and BENNETT UNIVERSITY.☆91Updated last year
- Speech Emotion Recognition☆41Updated last year
- 🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.☆18Updated this week
- Bangla-Bert is a pretrained bert model for Bengali language☆76Updated last year
- This repository contains the official release of the model "BanglaT5" and associated downstream finetuning code and datasets introduced i…☆83Updated last year
- Identify a voice as male or female.☆34Updated 7 years ago
- AI tool that generates an Audio short story based on the context of an uploaded image by prompting a GenAI LLM model, Hugging Face AI mod…☆21Updated 10 months ago