AI4Bharat / ShoonyaView external linksLinks
Shoonya - Platform to Annotate and label data at scale.
☆64Oct 31, 2025Updated 3 months ago
Alternatives and similar repositories for Shoonya
Users that are interested in Shoonya are comparing it to the libraries listed below
Sorting:
- Lisp dialect designed for HPC and AI☆26Updated this week
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- Sthaan uses AI to create digital addresses with local language support in voice/text, making it easier for people to find and reach locat…☆12Nov 17, 2024Updated last year
- A collaborative catalog of NLP resources for Indic languages☆628Dec 14, 2024Updated last year
- Font style transfer for Devanāgarī script using GANs☆12Jun 25, 2022Updated 3 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 5, 2025Updated last year
- Indic-Conformer models for ASR☆20Jul 19, 2024Updated last year
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆31May 22, 2025Updated 8 months ago
- ☆23Jun 5, 2025Updated 8 months ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆136Jan 2, 2024Updated 2 years ago
- proof of concept conversation orchestrator with a speech-language model☆20Oct 19, 2024Updated last year
- ☆18Jan 23, 2026Updated 3 weeks ago
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆51Feb 8, 2026Updated last week
- An LLM enabled XML generator for Indian laws in the LegalDocML and LegalRuleML formats☆19Sep 6, 2024Updated last year
- a map of goodreads☆40Nov 30, 2025Updated 2 months ago
- This repository contains the HiNER dataset released with our paper at LREC 2022☆16Jun 6, 2023Updated 2 years ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Jun 1, 2023Updated 2 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- Audio tokenization, in the fastest way possible!☆53Aug 26, 2024Updated last year
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆36Dec 24, 2025Updated last month
- ☆18Sep 19, 2023Updated 2 years ago
- Resources and tools for Indian language Natural Language Processing☆627Jun 7, 2024Updated last year
- Add Siri like Native AI Agents in you App.☆54Jan 18, 2025Updated last year
- a Frontier Japanese Speech Generation net☆60May 15, 2025Updated 9 months ago
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆205May 27, 2020Updated 5 years ago
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆24Oct 13, 2023Updated 2 years ago
- Temporary anonymous version☆22Mar 20, 2024Updated last year
- A PyTorch implementation of Parametric UMAP (Uniform Manifold Approximation and Projection) for learning low-dimensional parametric embed…☆33Mar 4, 2025Updated 11 months ago
- Because sometimes, the real API just won't cut it.☆12Jul 15, 2024Updated last year
- A universal messaging library for cross-platform applications (Chrome extension, Web, Mobile, Iframe,...)☆15Oct 10, 2025Updated 4 months ago
- ☆12Jan 17, 2026Updated 3 weeks ago
- A python package for whisper normalizer☆75Oct 6, 2025Updated 4 months ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆27Oct 20, 2022Updated 3 years ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago
- Transcribe your videos and translate it into Indic languages.☆31Jan 30, 2026Updated 2 weeks ago
- CMU multilingual speech repository☆30Apr 15, 2022Updated 3 years ago
- faster inference☆28Jan 20, 2025Updated last year
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago