rodosingh / Intro-NLP-IIITHLinks
Course Materials (along with assignments) for Intro to NLP, done as a part for requirement of the course "Introduction to NLP" (course-code: CS7.401.S22) @ IIITH. Note: If you are cloning this or taking help of this repo, try to star the repo.
☆9Updated 2 years ago
Alternatives and similar repositories for Intro-NLP-IIITH
Users that are interested in Intro-NLP-IIITH are comparing it to the libraries listed below
Sorting:
- A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanaga…☆36Updated last year
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆87Updated last year
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆129Updated last year
- ☆16Updated last year
- ☆32Updated last year
- SemEval 2024 Task 1 : Textual Semantic Relatedness☆25Updated last year
- ☆98Updated 5 months ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆41Updated 2 years ago
- This repository is dedicated to development of code-mixed language resources.☆26Updated 2 years ago
- Text to Speech for Indic languages☆51Updated 3 years ago
- Hinglish Text Classification☆30Updated 2 years ago
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME☆100Updated 4 months ago
- Marathi NLP - is a repository dedicated to development of tools and resources for Marathi language.☆142Updated 2 months ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆55Updated 4 years ago
- Transcribe your videos and translate it into Indic languages.☆31Updated last week
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆170Updated 2 years ago
- Description Describes the IndicNLP corpus and associated datasets☆177Updated 2 years ago
- ☆29Updated last year
- ☆18Updated 3 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- 📝 A not-so-fancy but still a pretty research CV☆84Updated 4 years ago
- ☆22Updated 2 years ago
- CaptionBot : Sequence to Sequence Modelling where Encoder is CNN(Resnet-50) and Decoder is LSTMCell with soft attention mechanism☆50Updated 3 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆55Updated last year
- IndicGenBench is a high-quality, multilingual, multi-way parallel benchmark for evaluating Large Language Models (LLMs) on 4 user-facing …☆52Updated 11 months ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆60Updated 9 months ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 2 years ago
- (Silver medal - 60th place - Top 3%) Repository for the "Tweet Sentiment Extraction" Kaggle competition.☆10Updated 5 years ago
- ☆15Updated 2 years ago
- Language identification and normalisation in code switching data tailored with a three-step decoding process☆24Updated 5 years ago