AI4Bharat / indic-asr-api-backendLinks
Indic-Conformer models for ASR
☆17Updated 11 months ago
Alternatives and similar repositories for indic-asr-api-backend
Users that are interested in indic-asr-api-backend are comparing it to the libraries listed below
Sorting:
- This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related …☆19Updated 2 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆19Updated last month
- Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]☆8Updated last year
- ☆14Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆14Updated 7 months ago
- MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation☆13Updated 3 months ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Updated 2 years ago
- Code for ICML25 Paper "Overcoming Non-monotonicity in Transducer-based Streaming Generation"☆11Updated last month
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆14Updated last year
- A semi-supervised sequence-to-sequence ASR☆10Updated 2 years ago
- English ASR Challenge organized by Speech Lab, IIT Madras☆11Updated 4 years ago
- DysfluentWFST☆13Updated last month
- ☆12Updated 5 months ago
- Dippy Synthetic Speech Subnet☆16Updated last month
- Russian phonetical transcription☆10Updated last year
- ☆11Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆19Updated 5 months ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆10Updated 5 months ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- ☆13Updated 10 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated last year
- ☆17Updated 4 years ago
- ☆11Updated 2 years ago
- ☆16Updated 4 months ago
- ☆34Updated 3 weeks ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆11Updated last month
- Goodness of Pronunciation algorithm using PyKaldi☆16Updated 3 years ago