raj-sutariya / indic-num2words
Python library for converting numbers to words for all Indian Languages.
☆34Updated 2 weeks ago
Alternatives and similar repositories for indic-num2words:
Users that are interested in indic-num2words are comparing it to the libraries listed below
- ☆42Updated 2 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆86Updated 2 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆82Updated 10 months ago
- ☆41Updated 2 years ago
- Text to Speech for Indic languages☆49Updated 2 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆51Updated 4 years ago
- ☆14Updated 2 years ago
- Dataset Release for Intent Classification from Speech☆46Updated last year
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆20Updated 3 years ago
- Server framework for Kaldi ASR Toolkit☆97Updated last year
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆122Updated last year
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆60Updated 3 years ago
- ☆17Updated 3 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆48Updated last year
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆71Updated 3 years ago
- Language identification and normalisation in code switching data tailored with a three-step decoding process☆24Updated 5 years ago
- ☆34Updated 4 months ago
- A Python based API to access Indian language WordNets.☆37Updated 2 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Updated 3 years ago
- A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset☆12Updated 3 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆41Updated last year
- Code for extracting parallel corpora from pmindia☆16Updated 4 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆113Updated 5 years ago
- ☆42Updated 3 years ago
- Various speech datasets made available to the public☆107Updated last month
- A module for normalising text.☆173Updated 3 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 5 years ago