AI4Bharat / IndicConformerASRLinks
☆46Updated 5 months ago
Alternatives and similar repositories for IndicConformerASR
Users that are interested in IndicConformerASR are comparing it to the libraries listed below
Sorting:
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆49Updated this week
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆64Updated 5 months ago
- ☆65Updated last month
- ☆314Updated last year
- Text-to-Speech for languages of India☆297Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆44Updated 2 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆351Updated 2 years ago
- A python package for whisper normalizer☆70Updated last month
- Finetune VITS and MMS using HuggingFace's tools☆176Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆103Updated 3 months ago
- ☆44Updated 2 years ago
- Joint speech-language model - respond directly to audio!☆371Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated last year
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆290Updated 6 months ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆98Updated 2 months ago
- Indic-Conformer models for ASR☆18Updated last year
- ☆48Updated 2 years ago
- Create an LJSpeech structured voice dataset on wave input☆37Updated last year
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆207Updated 7 months ago
- ☆59Updated 4 months ago
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech☆92Updated 2 years ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆214Updated 6 months ago
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆51Updated 11 months ago
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- ☆177Updated 11 months ago
- Vaksanca introduces free Sanskrit speech corpus with vowel segmentation.☆15Updated 4 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆104Updated 5 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last month
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆71Updated 4 months ago
- Fine Tune the Style-TTS2 Voice Model☆261Updated 5 months ago