harmanpreet93 / low-resource-machine-translation
Low resource machine translation using Transformers and Iterative Back translation
☆10Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for low-resource-machine-translation
- Fine-tune transformers with pytorch-lightning☆44Updated 2 years ago
- A benchmark for code-switched NLP, ACL 2020☆74Updated 5 months ago
- QED: A Framework and Dataset for Explanations in Question Answering☆114Updated 3 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 2 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆32Updated 3 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆52Updated 3 months ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆45Updated 2 years ago
- ☆46Updated 4 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated last year
- BERT, RoBERTa fine-tuning over SQuAD Dataset using pytorch-lightning⚡️, 🤗-transformers & 🤗-nlp.☆36Updated last year
- A web application that interfaces two GEC systems. [web instance is down]☆31Updated 3 months ago
- Material for the COLING 2020 Tutorial on Multilingual NMT☆16Updated 3 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆153Updated 10 months ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆60Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆77Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated last year
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆52Updated 2 years ago
- A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.☆79Updated 3 years ago
- Training T5 to perform numerical reasoning.☆23Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆82Updated 3 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆46Updated 3 years ago
- ☆34Updated 4 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆62Updated 6 months ago
- Build a dialog dataset from online books in many languages☆71Updated 2 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 4 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆31Updated 2 years ago
- Dataset of sentences from Hindi stories tagged with different emotion tags☆10Updated 4 years ago
- ⛔ [NOT MAINTAINED] A web-based annotator for closed-domain question answering datasets with SQuAD format.☆88Updated last year