AI4Bharat/IndicWav2Vec

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AI4Bharat/IndicWav2Vec)

AI4Bharat / IndicWav2Vec

Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2

☆117

Alternatives and similar repositories for IndicWav2Vec

Users that are interested in IndicWav2Vec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AI4Bharat / webcorpus
View on GitHub
Generate large textual corpora for almost any language by crawling the web
☆13Feb 17, 2024Updated 2 years ago
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
AI4Bharat / indic-asr-api-backend
View on GitHub
Indic-Conformer models for ASR
☆19Jul 19, 2024Updated 2 years ago
Open-Speech-EkStep / vakyansh-models
View on GitHub
Open source speech to text models for Indic Languages
☆327Sep 16, 2022Updated 3 years ago
CoEDL / vad-sli-asr
View on GitHub
A pipeline to isolate and transcribe one language in mixed-language speech
☆20Oct 25, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
AI4Bharat / Indic-TTS
View on GitHub
Text-to-Speech for languages of India
☆378Nov 8, 2024Updated last year
Open-Speech-EkStep / indic-punct
View on GitHub
☆45Dec 15, 2022Updated 3 years ago
vectominist / MiniASR
View on GitHub
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Dec 6, 2022Updated 3 years ago
Open-Speech-EkStep / vakyansh-wav2vec2-experimentation
View on GitHub
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
☆89Sep 22, 2022Updated 3 years ago
AI4Bharat / IndicTrans2
View on GitHub
Translation models for 22 scheduled languages of India
☆450Oct 3, 2025Updated 9 months ago
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
Open-Speech-EkStep / vakyansh-tts
View on GitHub
Text to Speech for Indic languages
☆53Mar 23, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
skinahan / DIVA_PyTorch
View on GitHub
Implementation of the DIVA model of speech acquisition and production using PyTorch
☆23Jan 18, 2023Updated 3 years ago
Open-Speech-EkStep / audio-to-speech-pipeline
View on GitHub
This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline
☆33Feb 15, 2023Updated 3 years ago
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
AI4Bharat / IndicBERT
View on GitHub
Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME
☆121Apr 6, 2025Updated last year
Open-Speech-EkStep / ULCA-asr-dataset-corpus
View on GitHub
☆50Nov 23, 2022Updated 3 years ago
raotnameh / End-to-end-E2E-Named-Entity-Recognition-from-English-Speech
View on GitHub
☆32Dec 2, 2020Updated 5 years ago
hbredin / pyannotebook
View on GitHub
🎹 pyannote + 🗒 notebook = pyannotebook
☆27Jun 12, 2023Updated 3 years ago
soumendrak / MTEnglish2Odia
View on GitHub
Machine Translation from English to Odia language.
☆10Aug 9, 2021Updated 4 years ago
GATECH-EIC / S3-Router
View on GitHub
[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…
☆17Sep 19, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
msplabresearch / MSP-Podcast_Challenge_IS2025
View on GitHub
MSP-Podcast Challenge Baseline Code for Interspeech 2025
☆29Dec 4, 2024Updated last year
coryshain / dnnseg
View on GitHub
☆11Mar 20, 2021Updated 5 years ago
AI4Bharat / indicTrans
View on GitHub
indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2
☆141Jan 2, 2024Updated 2 years ago
AI4Bharat / FBI
View on GitHub
FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists
☆31Aug 14, 2025Updated 11 months ago
qcri / e-wer
View on GitHub
Word Error Rate Estimation
☆16Aug 25, 2020Updated 5 years ago
bayartsogt-ya / whisper-multiple-hf-datasets
View on GitHub
Whisper fine-tuning event script to use multiple hf datasets
☆32Dec 20, 2022Updated 3 years ago
AI4Bharat / IndicMFA
View on GitHub
☆18Sep 13, 2024Updated last year
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
vakyansh / gemma-experimentation
View on GitHub
Experimentation on google's gemma model
☆15Mar 6, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
MrEdwards007 / WhisperTaskAcceleration
View on GitHub
Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization
☆25Oct 29, 2022Updated 3 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
AI4Bharat / DocSim
View on GitHub
Synthetically generate random text document images with ground-truth
☆14Jul 20, 2021Updated 5 years ago
yanghaha0908 / FastHuBERT
View on GitHub
Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning
☆100Nov 20, 2024Updated last year
smart-audio / audio_diarization_annotation
View on GitHub
Audio Diarization Annotation tool
☆30Nov 8, 2019Updated 6 years ago
skit-ai / Map-Mix
View on GitHub
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…
☆18Feb 17, 2023Updated 3 years ago