microsoft / GLUECoSLinks
A benchmark for code-switched NLP, ACL 2020
☆75Updated last year
Alternatives and similar repositories for GLUECoS
Users that are interested in GLUECoS are comparing it to the libraries listed below
Sorting:
- QED: A Framework and Dataset for Explanations in Question Answering☆117Updated 4 years ago
- Code to reproduce the experiments from the paper.☆101Updated last year
- ☆203Updated 3 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- Dataset of ML and NLP papers☆34Updated 3 years ago
- Question-answers, collected from Google☆128Updated 4 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆95Updated 6 months ago
- ☆75Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆362Updated 3 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆137Updated 2 years ago
- Code and Data for Evaluation WG☆42Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆314Updated 5 years ago
- State of the art Semantic Sentence Embeddings☆99Updated 3 years ago
- Stanford's Alexa Prize socialbot☆133Updated 2 years ago
- Yet Another Neural Machine Translation Toolkit☆178Updated 7 months ago
- Codebase for probing and visualizing multilingual models.☆49Updated 5 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆156Updated last year
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆200Updated 5 years ago
- New dataset☆307Updated 4 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 4 years ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆50Updated 3 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 5 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Updated 6 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago