Xlit-Crowd: Hindi-English Transliteration Corpus
☆38Feb 17, 2015Updated 11 years ago
Alternatives and similar repositories for crowd-indic-transliteration-data
Users that are interested in crowd-indic-transliteration-data are comparing it to the libraries listed below
Sorting:
- Hindi-English Transliteration Using sequence to sequence learning☆17Apr 3, 2017Updated 8 years ago
- Submission for the Programming Task for the Precog Recruitment Process (II)☆14Jul 29, 2016Updated 9 years ago
- Language Identification and transliteration tool for Indian language code mixed data.☆24Feb 29, 2016Updated 10 years ago
- A collection of basic text processing modules focused on Gujarati☆10Oct 24, 2017Updated 8 years ago
- POS tagging models for Hindi English Code Mixed Tweets☆11Aug 1, 2018Updated 7 years ago
- The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including Engl…☆273Oct 28, 2022Updated 3 years ago
- A Hindi-English Dataset for Text Normalization☆17Jan 3, 2022Updated 4 years ago
- It is a simple tool to convert roman script to indic(Devanagari) script. As most Keyboards are English and to write in Indic script is di…☆13Aug 31, 2016Updated 9 years ago
- Funny text generation using character level LSTM model, featured in TED ideas☆24Sep 21, 2018Updated 7 years ago
- [ONGOING] ACM ICPC Handbook for Algorithms and Data Structures☆24Oct 25, 2020Updated 5 years ago
- Language identification and normalisation in code switching data tailored with a three-step decoding process☆24Dec 23, 2019Updated 6 years ago
- This is a package in Python which implements a tokenizer, stemmer for Hindi language☆95Oct 2, 2020Updated 5 years ago
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆59Jul 9, 2021Updated 4 years ago
- Hinglish Text Classification☆30Jun 12, 2023Updated 2 years ago
- A collaborative catalog of NLP resources for Indic languages☆627Dec 14, 2024Updated last year
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆205May 27, 2020Updated 5 years ago
- Resources to go with the Indic NLP Library☆78Jun 12, 2022Updated 3 years ago
- ☆30Nov 1, 2019Updated 6 years ago
- Clinical spelling correction with word and character n-gram embeddings.☆75Jun 21, 2022Updated 3 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago
- Official repository for ACM Multimedia'23 paper "MATK: The Meme Analytical Tool Kit"☆13May 29, 2024Updated last year
- Minimal event-driven framework for Java.☆15Apr 14, 2019Updated 6 years ago
- This is the Javascript Code, it helps you to find you visited your Facebook Profile.☆12Sep 13, 2018Updated 7 years ago
- ULMFiT + Siamese Network for Sentence Vectors☆33Oct 18, 2018Updated 7 years ago
- a python package for cleaning Gutenberg books and dataset☆34May 2, 2025Updated 10 months ago
- Tamil Language words list☆12Jul 2, 2016Updated 9 years ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 5 months ago
- Documentation website for Happy Coder☆18Jan 12, 2026Updated last month
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- Tools and Examples for Computational Text Analysis for Assyriologists.☆11Sep 3, 2018Updated 7 years ago
- A lightweight Application Framework for Python powered by Dependency Injection☆20Dec 15, 2025Updated 2 months ago
- Automatic transliteration with LSTM☆93Dec 7, 2018Updated 7 years ago
- common lib like apache commons☆11May 5, 2016Updated 9 years ago
- A semantic role labeling system for the Sumerian language. A Google Summer of Code '18 initiative.☆16Feb 10, 2023Updated 3 years ago
- A curated list of papers & resources linked to concept learning☆12Aug 9, 2023Updated 2 years ago
- karthikbmk's independent study☆10Sep 2, 2017Updated 8 years ago
- Banyan is Trees for Python☆12Jul 14, 2014Updated 11 years ago
- Java and Scala client libraries for Concord☆13Feb 15, 2017Updated 9 years ago
- ☆10Aug 1, 2018Updated 7 years ago