AI4Bharat / vistaarLinks

Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR

☆69

Alternatives and similar repositories for vistaar

Users that are interested in vistaar are comparing it to the libraries listed below

Sorting:

AI4Bharat / IndicWav2Vec
Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2
☆101Updated 4 months ago
Open-Speech-EkStep / ULCA-asr-dataset-corpus
☆49Updated 3 years ago
kurianbenoy / whisper_normalizer
A python package for whisper normalizer
☆71Updated 2 months ago
Open-Speech-EkStep / indic-punct
☆45Updated 3 years ago
AI4Bharat / NPTEL2020-Indian-English-Speech-Dataset
NPTEL2020: Speech2Text dataset for Indian-English Accent
☆79Updated 4 years ago
jasonppy / PromptingWhisper
Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation
☆151Updated last year
Open-Speech-EkStep / vakyansh-wav2vec2-experimentation
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
☆87Updated 3 years ago
AI4Bharat / Svarah
Swarah: Indian-English speech dataset collected across the country
☆37Updated 5 months ago
muskang48 / Speaker-Diarization
This project is about performing Speaker diarization for Hindi Language.
☆58Updated 4 years ago
Open-Speech-EkStep / data-acquisition-pipeline
☆17Updated 4 years ago
Open-Speech-EkStep / vakyansh-tts
Text to Speech for Indic languages
☆52Updated 3 years ago
FrenchKrab / IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
☆92Updated 2 years ago
gokulkarthik / text2speech
Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023
☆55Updated 2 years ago
ylacombe / finetune-hf-vits
Finetune VITS and MMS using HuggingFace's tools
☆184Updated last year
revdotcom / speech-datasets
Various speech datasets made available to the public
☆130Updated last year
HHousen / speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer
☆56Updated 7 months ago
AdroitAnandAI / Indian-Accent-Speech-Recognition
Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech
☆92Updated 2 years ago
ErikEkstedt / VoiceActivityProjection
Voice Activity Projection Models: Self-supervised learning of Turn-taking Events
☆89Updated last year
crazycloud / mispronunciation-detection-diagnosis-wav2vec2-and-llm
Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…
☆43Updated last year
skit-ai / speech-to-intent-dataset
Dataset Release for Intent Classification from Speech
☆48Updated 10 months ago
AI4Bharat / IndicVoices-R
A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS
☆52Updated last year
mbzuai-nlp / ArTST
☆60Updated 5 months ago
neulab / AfricanVoices
Hosts text-to-speech corpus and speech synthesizers for African languages.
☆17Updated 2 years ago
vasistalodagala / whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
☆356Updated 2 years ago
rendchevi / daisy-tts
🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
☆14Updated last month
smtiitm / Fastspeech2_MFA
Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…
☆15Updated last year
huggingface / open_asr_leaderboard
☆156Updated 3 weeks ago
huggingface / diarizers
☆319Updated last year
skit-ai / Map-Mix
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…
☆18Updated 2 years ago
skit-ai / SpeechLLM
This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…
☆125Updated last year