Open source speech to text models for Indic Languages
☆325Sep 16, 2022Updated 3 years ago
Alternatives and similar repositories for vakyansh-models
Users that are interested in vakyansh-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆88Sep 22, 2022Updated 3 years ago
- Text to Speech for Indic languages☆52Mar 23, 2022Updated 4 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Feb 15, 2023Updated 3 years ago
- ☆45Dec 15, 2022Updated 3 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆110Aug 28, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Dec 15, 2022Updated 3 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆10Nov 5, 2020Updated 5 years ago
- ☆18Apr 28, 2021Updated 4 years ago
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆137Jan 2, 2024Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- Dataset release for Emotional TTS in Indian Accent☆40Sep 2, 2022Updated 3 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Oct 12, 2022Updated 3 years ago
- State-Of-The-Art & ready to use mini NLP models for Indian Languages☆43May 22, 2021Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.c…☆292May 11, 2023Updated 2 years ago
- Text-to-Speech for languages of India☆345Nov 8, 2024Updated last year
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆51Jul 20, 2022Updated 3 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 3 years ago
- Translation models for 22 scheduled languages of India☆414Oct 3, 2025Updated 5 months ago
- A large scale Sanskrit-English translation dataset☆80Mar 20, 2023Updated 3 years ago
- Open Source Speech Inferencing Libary for Indic Languages☆12Apr 11, 2022Updated 3 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including Engl…☆274Oct 28, 2022Updated 3 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Jan 11, 2022Updated 4 years ago
- Expressive TTS Dataset for Assamese, Bengali, and Tamil.☆15Mar 6, 2025Updated last year
- ☆23May 5, 2022Updated 3 years ago
- Generate large textual corpora for almost any language by crawling the web☆13Feb 17, 2024Updated 2 years ago
- Dataset Release for Intent Classification from Speech☆48Feb 23, 2025Updated last year
- Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer m…☆839Jan 20, 2024Updated 2 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Apr 1, 2021Updated 4 years ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 3 years ago
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME☆109Apr 6, 2025Updated 11 months ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language☆43Feb 28, 2018Updated 8 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆83Dec 24, 2021Updated 4 years ago
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago