Open source speech to text models for Indic Languages
☆328Sep 16, 2022Updated 3 years ago
Alternatives and similar repositories for vakyansh-models
Users that are interested in vakyansh-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆88Sep 22, 2022Updated 3 years ago
- Text to Speech for Indic languages☆52Mar 23, 2022Updated 4 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Feb 15, 2023Updated 3 years ago
- ☆45Dec 15, 2022Updated 3 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆111Aug 28, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Dec 15, 2022Updated 3 years ago
- A collaborative catalog of NLP resources for Indic languages☆631Dec 14, 2024Updated last year
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆10Nov 5, 2020Updated 5 years ago
- ☆18Apr 28, 2021Updated 5 years ago
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆139Jan 2, 2024Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- Dataset release for Emotional TTS in Indian Accent☆40Mar 25, 2026Updated last month
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Oct 12, 2022Updated 3 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆22Jul 26, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- State-Of-The-Art & ready to use mini NLP models for Indian Languages☆43May 22, 2021Updated 4 years ago
- Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.c…☆296May 11, 2023Updated 2 years ago
- Text-to-Speech for languages of India☆361Nov 8, 2024Updated last year
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆51Jul 20, 2022Updated 3 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 3 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- Translation models for 22 scheduled languages of India☆427Oct 3, 2025Updated 7 months ago
- A large scale Sanskrit-English translation dataset☆81Mar 20, 2023Updated 3 years ago
- Open Source Speech Inferencing Libary for Indic Languages☆12Apr 11, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Resources and tools for Indian language Natural Language Processing☆637Jun 7, 2024Updated last year
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Jan 11, 2022Updated 4 years ago
- The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including Engl…☆276Oct 28, 2022Updated 3 years ago
- Expressive TTS Dataset for Assamese, Bengali, and Tamil.☆15Mar 6, 2025Updated last year
- ☆24May 5, 2022Updated 4 years ago
- Generate large textual corpora for almost any language by crawling the web☆13Feb 17, 2024Updated 2 years ago
- Dataset Release for Intent Classification from Speech☆48Feb 23, 2025Updated last year
- Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer m…☆840Jan 20, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Apr 1, 2021Updated 5 years ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 3 years ago
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME☆113Apr 6, 2025Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language☆43Feb 28, 2018Updated 8 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago