AI4Bharat / indic-asr-api-backend
Indic-Conformer models for ASR
β15Updated 4 months ago
Related projects β
Alternatives and complementary repositories for indic-asr-api-backend
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 4 years ago
- A python package for whisper normalizerβ44Updated 4 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated 9 months ago
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.β13Updated 6 months ago
- English ASR Challenge organized by Speech Lab, IIT Madrasβ11Updated 3 years ago
- β40Updated last year
- β9Updated last month
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipelineβ32Updated last year
- babyLM WhisBERT codeβ17Updated 5 months ago
- This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related β¦β14Updated last year
- Collection of scripts from mHuBERT-147.β22Updated this week
- Using YouTube to prepare a speech recognition dataset for any languageβ10Updated 3 years ago
- Audio tokenization, in the fastest way possible!β45Updated 2 months ago
- β16Updated 3 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.β16Updated 3 weeks ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic biasβ21Updated 4 years ago
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speechβ91Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sβ¦β27Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.β12Updated this week
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β28Updated last year
- β11Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Updated last year
- β28Updated 2 weeks ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Updated 3 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ14Updated 2 years ago
- β11Updated last year
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech represβ¦β16Updated 8 months ago
- Speaker diarization serviceβ19Updated this week
- β14Updated last year
- β11Updated last year