AI4Bharat / IndicConformerASR
☆24Updated last week
Alternatives and similar repositories for IndicConformerASR:
Users that are interested in IndicConformerASR are comparing it to the libraries listed below
- Indic-Conformer models for ASR☆17Updated 8 months ago
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆24Updated 2 weeks ago
- ☆17Updated 3 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆17Updated last month
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆37Updated 4 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆61Updated this week
- ☆45Updated 2 years ago
- ☆14Updated 8 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆12Updated last year
- Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation …☆16Updated last year
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Updated 2 years ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆49Updated 9 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆25Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆12Updated last year
- ☆10Updated this week
- Tensorflow-based wake word detection☆12Updated 6 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated last month
- Open TTS models, built for streaming on the edge☆39Updated 3 weeks ago
- Joint speech-language model - respond directly to audio!☆30Updated 10 months ago
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆9Updated 6 months ago
- English ASR Challenge organized by Speech Lab, IIT Madras☆11Updated 4 years ago
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆13Updated 11 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- Russian phonetical transcription☆10Updated last year
- ☆12Updated 2 months ago
- ☆10Updated last month
- Speaker diarization service☆21Updated last week
- This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related …☆19Updated 2 years ago