Open-Speech-EkStep/indic-punct

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Open-Speech-EkStep/indic-punct)

Open-Speech-EkStep / indic-punct

☆45

Alternatives and similar repositories for indic-punct

Users that are interested in indic-punct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Open-Speech-EkStep / data-acquisition-pipeline
View on GitHub
☆18Apr 28, 2021Updated 5 years ago
AI4Bharat / webcorpus
View on GitHub
Generate large textual corpora for almost any language by crawling the web
☆13Feb 17, 2024Updated 2 years ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
amazon-science / proteno
View on GitHub
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45May 25, 2021Updated 5 years ago
Open-Speech-EkStep / audio-to-speech-pipeline
View on GitHub
This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline
☆33Feb 15, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Speech-Lab-IITM / Hindi-ASR-Challenge
View on GitHub
🎯 Speech Recognition Challenge by Speech Lab - IIT Madras
☆10Nov 5, 2020Updated 5 years ago
Open-Speech-EkStep / vakyansh-models
View on GitHub
Open source speech to text models for Indic Languages
☆327Sep 16, 2022Updated 3 years ago
Open-Speech-EkStep / vakyansh-tts
View on GitHub
Text to Speech for Indic languages
☆53Mar 23, 2022Updated 4 years ago
Open-Speech-EkStep / speech-recognition-open-api
View on GitHub
☆13Dec 15, 2022Updated 3 years ago
Open-Speech-EkStep / vakyansh-wav2vec2-experimentation
View on GitHub
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
☆89Sep 22, 2022Updated 3 years ago
AI4Bharat / indic-numtowords
View on GitHub
A simple lightweight library for text normalization for Indian Languages
☆18Sep 30, 2025Updated 9 months ago
raj-sutariya / indic-num2words
View on GitHub
Python library for converting numbers to words for all Indian Languages.
☆38May 23, 2025Updated last year
Prem-kumar27 / Fast-KTSpeechCrawler
View on GitHub
Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler
☆23Mar 21, 2021Updated 5 years ago
robmsmt / SpeechLoop
View on GitHub
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
☆19Oct 5, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
skit-ai / emotion-tts-dataset
View on GitHub
Dataset release for Emotional TTS in Indian Accent
☆41Mar 25, 2026Updated 3 months ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
EZ-VC / EZ-VC
View on GitHub
[EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion
☆43Sep 9, 2025Updated 10 months ago
AI4Bharat / IndicWav2Vec
View on GitHub
Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2
☆117Aug 28, 2025Updated 10 months ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
AI4Bharat / DocSim
View on GitHub
Synthetically generate random text document images with ground-truth
☆14Jul 20, 2021Updated 5 years ago
Open-Speech-EkStep / crowdsource-dataplatform
View on GitHub
This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…
☆17Mar 6, 2023Updated 3 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
CUNY-CL / wikipron-modeling
View on GitHub
Proposed splits for the LREC Wikipron paper
☆15Apr 7, 2020Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
maveryn / punctuation-restoration
View on GitHub
[W-NUT'20] Punctuation Restoration using Transformer Models for High-and Low-Resource Languages
☆227Jul 29, 2024Updated last year
RuABraun / texterrors
View on GitHub
☆37Jun 9, 2026Updated last month
EndlessReform / smoltts
View on GitHub
Open TTS models, built for streaming on the edge
☆45Mar 16, 2025Updated last year
awasthiabhijeet / Error-Driven-ASR-Personalization
View on GitHub
Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021
☆11Jun 13, 2021Updated 5 years ago
tincans-ai / gazelle-inference
View on GitHub
proof of concept conversation orchestrator with a speech-language model
☆20Oct 19, 2024Updated last year
Felflare / rpunct
View on GitHub
📝An easy-to-use package to restore punctuation of the text.
☆120Apr 5, 2023Updated 3 years ago
NVIDIA / NeMo-text-processing
View on GitHub
NeMo text processing for ASR and TTS
☆484Jul 16, 2026Updated last week
AI4Bharat / FBI
View on GitHub
FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists
☆31Aug 14, 2025Updated 11 months ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jagabandhumishra / IEEE-Summer-School
View on GitHub
☆11Aug 3, 2021Updated 4 years ago
AI4Bharat / IndicMFA
View on GitHub
☆18Sep 13, 2024Updated last year
MahirMahbub / Contextual-Spell-Checker-For-Bangla
View on GitHub
Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance
☆21Nov 18, 2024Updated last year
skit-ai / Map-Mix
View on GitHub
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…
☆18Feb 17, 2023Updated 3 years ago
applicaai / pyramidions
View on GitHub
This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…
☆14May 15, 2022Updated 4 years ago
mguner / audio_search
View on GitHub
Use speech_to_text for keyword search in audio files.
☆12May 5, 2021Updated 5 years ago
AlanBaade / SyllableLM
View on GitHub
Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models
☆63Jul 1, 2025Updated last year