Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus
☆13Feb 17, 2019Updated 7 years ago
Alternatives and similar repositories for en-hi-codemixed-corpus
Users that are interested in en-hi-codemixed-corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains all resources corresponding to the various TechX sessions at IIIT Hyderabad☆20Dec 12, 2018Updated 7 years ago
- Language identification and normalisation in code switching data tailored with a three-step decoding process☆24Dec 23, 2019Updated 6 years ago
- Hindi-English Transliteration Using sequence to sequence learning☆17Apr 3, 2017Updated 9 years ago
- Xlit-Crowd: Hindi-English Transliteration Corpus☆38Feb 17, 2015Updated 11 years ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆51Jul 20, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆60Jul 30, 2024Updated last year
- A benchmark for code-switched NLP, ACL 2020☆76May 28, 2024Updated last year
- ☆34Nov 29, 2016Updated 9 years ago
- It is a simple tool to convert roman script to indic(Devanagari) script. As most Keyboards are English and to write in Indic script is di…☆13Aug 31, 2016Updated 9 years ago
- Tooling to play around with multilingual machine translation for Indian Languages.☆22Mar 5, 2022Updated 4 years ago
- Multilingual Neural Machine Translation using Transformers with Conditional Normalization.☆18Mar 24, 2023Updated 3 years ago
- POS tagging models for Hindi English Code Mixed Tweets☆11Aug 1, 2018Updated 7 years ago
- Improving cross-lingual word embeddings by meeting in the middle☆23Aug 25, 2020Updated 5 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆17May 11, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Python library for converting UTF to WX and vice-versa for Indian languages.☆11Jan 17, 2019Updated 7 years ago
- Code and data for the EMNLP 2019 paper "In Plain Sight: Media Bias Through the Lens of Factual Reporting"☆10Feb 15, 2022Updated 4 years ago
- ☆16Oct 12, 2020Updated 5 years ago
- [EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages☆23Feb 13, 2023Updated 3 years ago
- Japanese--Russian--English News Commentary Parallel Data☆18Jul 9, 2019Updated 6 years ago
- Source Code for "Improved Embeddings for Learning Prerequisite Chains" (CPSC 490 - Senior Project)☆11May 2, 2019Updated 6 years ago
- Which ML are you?☆13Jan 3, 2023Updated 3 years ago
- Sequence tagger based on BERT☆20Apr 28, 2022Updated 3 years ago
- Curated list of publicly available parallel corpus for Indian Languages☆37Jul 15, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Material for introduction to Deep Learning Tutorials, Summer '20, '21☆16Jul 2, 2021Updated 4 years ago
- Source code and data for the paper "Towards String-to-Tree Neural Machine Translation"☆16Dec 31, 2017Updated 8 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- Hinglish Text Classification☆30Jun 12, 2023Updated 2 years ago
- A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanaga…☆37Jan 14, 2024Updated 2 years ago
- An NMT framework built on Joint Representation☆12Feb 19, 2020Updated 6 years ago
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Aug 13, 2023Updated 2 years ago
- SEMEVAL 2020 TASK 11 "DETECTION OF PROPAGANDA TECHNIQUES IN NEWS ARTICLES"☆22Jun 12, 2023Updated 2 years ago
- Official repository for ACM Multimedia'23 paper "MATK: The Meme Analytical Tool Kit"☆13May 29, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A collaborative catalog of NLP resources for Indic languages☆630Dec 14, 2024Updated last year
- A curated list of papers & resources linked to concept learning☆12Aug 9, 2023Updated 2 years ago
- Machine Translation from English to Odia language.☆10Aug 9, 2021Updated 4 years ago
- ☆12Jan 21, 2019Updated 7 years ago
- a very fast parser for sparse matrix at libsvm format☆10Nov 13, 2017Updated 8 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago