Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus
☆13Feb 17, 2019Updated 7 years ago
Alternatives and similar repositories for en-hi-codemixed-corpus
Users that are interested in en-hi-codemixed-corpus are comparing it to the libraries listed below
Sorting:
- This repository contains all resources corresponding to the various TechX sessions at IIIT Hyderabad☆20Dec 12, 2018Updated 7 years ago
- Language identification and normalisation in code switching data tailored with a three-step decoding process☆24Dec 23, 2019Updated 6 years ago
- Hindi-English Transliteration Using sequence to sequence learning☆17Apr 3, 2017Updated 8 years ago
- ☆10Aug 1, 2018Updated 7 years ago
- Geometry-aware Multilingual Embeddings☆26Dec 8, 2022Updated 3 years ago
- Language Identification and transliteration tool for Indian language code mixed data.☆24Feb 29, 2016Updated 10 years ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆51Jul 20, 2022Updated 3 years ago
- Xlit-Crowd: Hindi-English Transliteration Corpus☆38Feb 17, 2015Updated 11 years ago
- POS tagging models for Hindi English Code Mixed Tweets☆11Aug 1, 2018Updated 7 years ago
- A benchmark for code-switched NLP, ACL 2020☆76May 28, 2024Updated last year
- EMNLP 2020: Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots☆12Dec 15, 2020Updated 5 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆58Jul 30, 2024Updated last year
- Multilingual Neural Machine Translation using Transformers with Conditional Normalization.☆18Mar 24, 2023Updated 2 years ago
- It is a simple tool to convert roman script to indic(Devanagari) script. As most Keyboards are English and to write in Indic script is di…☆13Aug 31, 2016Updated 9 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆17May 11, 2020Updated 5 years ago
- Code for the paper "Improving Robustness of Machine Translation with Synthetic Noise"☆21Dec 23, 2019Updated 6 years ago
- Improving cross-lingual word embeddings by meeting in the middle☆23Aug 25, 2020Updated 5 years ago
- Tooling to play around with multilingual machine translation for Indian Languages.☆22Mar 5, 2022Updated 4 years ago
- Repository of SMAI homeworks, Monsoon 2019-20.☆19Dec 7, 2019Updated 6 years ago
- Source code for "Improving Robustness of Neural Machine Translation with Multi-task Learning"☆19Aug 15, 2019Updated 6 years ago
- NumPy - PyTorch - TensorFlow (+Keras)☆23Oct 19, 2020Updated 5 years ago
- Open Source Python SDK for AI Agents Identity☆34Jan 20, 2026Updated last month
- "We must know. We shall know." - David Hilbert☆21Sep 8, 2025Updated 6 months ago
- Archive of my notes taken at lectures in IIITH☆25Aug 15, 2021Updated 4 years ago
- Curated list of publicly available parallel corpus for Indian Languages☆37Jul 15, 2021Updated 4 years ago
- This is the Javascript Code, it helps you to find you visited your Facebook Profile.☆12Sep 13, 2018Updated 7 years ago
- Official repository for ACM Multimedia'23 paper "MATK: The Meme Analytical Tool Kit"☆13May 29, 2024Updated last year
- 🕵 Given a user query this python module will returns a list of related searches you see on Google search results pages.☆11Sep 28, 2018Updated 7 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago
- Create images and texts with the First Order Generative Adversarial Networks arxiv.org/abs/1802.04591☆35Feb 14, 2018Updated 8 years ago
- Source Code for "Improved Embeddings for Learning Prerequisite Chains" (CPSC 490 - Senior Project)☆11May 2, 2019Updated 6 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- [AAAI 2021 Workshop] The official repository for the LST-MAP model for few-shot image classification.☆13Feb 12, 2021Updated 5 years ago
- ☆10Nov 9, 2020Updated 5 years ago
- A collaborative catalog of NLP resources for Indic languages☆627Dec 14, 2024Updated last year
- An R package for the Latent Environmental & Genetic InTeraction (LEGIT) model☆11Feb 11, 2021Updated 5 years ago
- Executable script for pony voice synthesis project☆11Jun 21, 2022Updated 3 years ago
- Image and video processing toolbox☆10Jun 12, 2020Updated 5 years ago
- Python bindings for CityHash☆10Nov 7, 2025Updated 4 months ago