aswinpradeep / malayalam-asr-datasets
Repository contains various Malayalam ASR based resources curated from multiple sources
☆17Updated 3 years ago
Alternatives and similar repositories for malayalam-asr-datasets
Users that are interested in malayalam-asr-datasets are comparing it to the libraries listed below
Sorting:
- State of the Art Language models and Classifier for Malayalam, which is spoken by the Malayali people in the Indian state of Kerala and t…☆38Updated 4 years ago
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech☆92Updated last year
- Text to Speech for Indic languages☆50Updated 3 years ago
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆127Updated last year
- Transcribe your videos and translate it into Indic languages.☆30Updated last week
- ☆49Updated 5 years ago
- Description Describes the IndicNLP corpus and associated datasets☆172Updated 2 years ago
- Speeech Recognition for Indic languages.☆13Updated 4 years ago
- Generate large textual corpora for almost any language by crawling the web☆12Updated last year
- A python package for whisper normalizer☆58Updated this week
- A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanaga…☆36Updated last year
- Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.☆88Updated last month
- covid question answering datasets and fine tuned models☆19Updated 4 years ago
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆60Updated 3 years ago
- Dataset of sentences from Hindi stories tagged with different emotion tags☆11Updated 5 years ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆54Updated 2 years ago
- Do everything from data collection from reddit to training a machine learning model in just two lines of python code!☆82Updated last year
- A Continually LoRA PreTrained and FineTuned 7B Llama-2 Indic model for Malayalam Language.☆60Updated 9 months ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆86Updated last year
- Transliteration models for 21 Indic languages☆89Updated last year
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆42Updated 2 years ago
- Tutorial on English to Hindi Transliteration using Seq2Seq Architecture in Tensorflow☆16Updated 5 years ago
- ☆32Updated last year
- Hinglish Text Classification☆30Updated last year
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆47Updated 2 years ago
- Python library for converting numbers to words for all Indian Languages.☆35Updated 4 months ago
- Open source speech to text models for Indic Languages☆300Updated 2 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆75Updated 3 years ago
- State of the Art Language models and Classifier for Tamil language (spoken in India, and few other South Asian countries)☆53Updated 4 years ago
- Translation models for 22 scheduled languages of India☆312Updated last week