AI4Bharat / indic-asr-api-backend
Indic-Conformer models for ASR
β17Updated 8 months ago
Alternatives and similar repositories for indic-asr-api-backend:
Users that are interested in indic-asr-api-backend are comparing it to the libraries listed below
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 4 years ago
- English ASR Challenge organized by Speech Lab, IIT Madrasβ11Updated 4 years ago
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Aiβ¦β9Updated 6 months ago
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.β13Updated 11 months ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASRβ49Updated 9 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related β¦β19Updated 2 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ14Updated 4 months ago
- β24Updated last week
- β17Updated 3 years ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.β12Updated 5 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Updated last year
- Using YouTube to prepare a speech recognition dataset for any languageβ10Updated 4 years ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student trainiβ¦β12Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.β17Updated last month
- β12Updated 2 months ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Updated 4 years ago
- β14Updated 8 months ago
- A semi-supervised sequence-to-sequence ASRβ10Updated 2 years ago
- β10Updated last month
- A python package for whisper normalizerβ53Updated last month
- Swarah: Indian-English speech dataset collected across the countryβ29Updated last year
- Collection of scripts from mHuBERT-147.β24Updated 4 months ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ14Updated 3 years ago
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speechβ92Updated last year
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.β10Updated 2 months ago
- β9Updated 5 years ago
- 'Grad-TTS' with Multilingual Cleanersβ10Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.β21Updated 3 weeks ago
- β11Updated 9 years ago