AI4Bharat / indic-asr-api-backendLinks
Indic-Conformer models for ASR
β18Updated last year
Alternatives and similar repositories for indic-asr-api-backend
Users that are interested in indic-asr-api-backend are comparing it to the libraries listed below
Sorting:
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Updated 4 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcriptionβ19Updated this week
- β17Updated 4 years ago
- This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related β¦β21Updated 2 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using aβ¦β12Updated 2 years ago
- β14Updated 2 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ14Updated 9 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelizationβ25Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ24Updated 4 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- β11Updated 3 years ago
- β13Updated 10 years ago
- β11Updated this week
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Updated 2 years ago
- Dippy Synthetic Speech Subnetβ17Updated 3 weeks ago
- English ASR Challenge organized by Speech Lab, IIT Madrasβ11Updated 4 years ago
- Zero-Shot Foreign Accent Conversion without a Native Referenceβ34Updated last year
- Goodness of Pronunciation algorithm using PyKaldiβ16Updated 3 years ago
- β20Updated last year
- β14Updated last year
- Example workflow for our data-centric speech benchmarkβ17Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.β20Updated last month
- Enable RNNLM lattice rescoring with Pytorch [kaldi]β12Updated 5 years ago
- Audio tokenization, in the fastest way possible!β52Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networksβ17Updated 2 years ago
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speechβ93Updated last year
- Swarah: Indian-English speech dataset collected across the countryβ35Updated 2 months ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sβ¦β28Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated 2 years ago