microsoft / GLUECoSLinks
A benchmark for code-switched NLP, ACL 2020
☆75Updated last year
Alternatives and similar repositories for GLUECoS
Users that are interested in GLUECoS are comparing it to the libraries listed below
Sorting:
- Code to reproduce the experiments from the paper.☆101Updated last year
- Code and Data for Evaluation WG☆42Updated 3 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆117Updated 4 years ago
- Interactive Neural Machine Translation tool☆53Updated 2 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- ☆199Updated 3 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Updated 4 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆56Updated 2 years ago
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Updated 6 years ago
- Exploring the Limits of Low-Resource Neural Machine Translation☆34Updated 2 years ago
- Question-answers, collected from Google☆129Updated 4 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 4 years ago
- Yet Another Neural Machine Translation Toolkit☆179Updated 5 months ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- ☆75Updated 4 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆155Updated last year
- Dataset of ML and NLP papers☆34Updated 2 years ago
- New dataset☆306Updated 3 years ago
- Viewer for the 🤗 datasets library.☆84Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆361Updated 3 years ago
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆195Updated 5 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆94Updated 4 months ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆49Updated 3 years ago
- Codebase for probing and visualizing multilingual models.☆49Updated 5 years ago
- State of the art Semantic Sentence Embeddings☆99Updated 3 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 3 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- Curated list of publicly available parallel corpus for Indian Languages☆33Updated 4 years ago
- Build a dialog dataset from online books in many languages☆76Updated 2 years ago