AI4Bharat / indic-asr-api-backendLinks
Indic-Conformer models for ASR
☆17Updated 10 months ago
Alternatives and similar repositories for indic-asr-api-backend
Users that are interested in indic-asr-api-backend are comparing it to the libraries listed below
Sorting:
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆56Updated last month
- English ASR Challenge organized by Speech Lab, IIT Madras☆11Updated 4 years ago
- ☆17Updated 4 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- A python package for whisper normalizer☆60Updated 3 weeks ago
- ☆11Updated 3 years ago
- Swarah: Indian-English speech dataset collected across the country☆33Updated 2 years ago
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆14Updated last year
- ☆31Updated 2 months ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- ☆46Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆18Updated 3 months ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆10Updated 7 months ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Updated 2 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 3 weeks ago
- Text to Speech for Indic languages☆51Updated 3 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆86Updated last year
- Dippy Synthetic Speech Subnet☆16Updated 2 weeks ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- Russian phonetical transcription☆10Updated last year
- Audio tokenization, in the fastest way possible!☆52Updated 9 months ago
- ☆12Updated 4 months ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago
- ☆15Updated 3 months ago
- ☆43Updated 2 years ago
- The Vokan Architecture (Tsukasa speech based)☆9Updated 3 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Updated 3 years ago