salesforce / bite
Code for "Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding" (EMNLP 2020).
☆11Updated 2 weeks ago
Alternatives and similar repositories for bite
Users that are interested in bite are comparing it to the libraries listed below
Sorting:
- Diagnostic tests for linguistic capacities in language models☆66Updated 3 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆82Updated 4 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆101Updated 9 months ago
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Updated last year
- End-to-end shallow discourse parser☆20Updated last year
- The Benchmark of Linguistic Minimal Pairs☆150Updated 2 years ago
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆16Updated 10 months ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- ☆29Updated 2 years ago
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Updated 3 years ago
- ☆39Updated 3 years ago
- ☆28Updated 11 months ago
- Codebase, data and models for the Keep it Simple paper at ACL2021☆39Updated last year
- ☆15Updated 3 years ago
- This repository houses the IMPlicature and PRESupposition diagnostic dataset (IMPPRES), consisting of >25k semiautomatically generated se…☆19Updated 3 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆66Updated 3 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation☆17Updated 4 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆143Updated 2 years ago
- A coreference evaluation package for the CoNLL and ARRAU datasets☆40Updated 4 years ago
- Code and Data for Evaluation WG☆41Updated 3 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆87Updated 2 weeks ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- ☆36Updated 3 years ago
- ☆52Updated 3 years ago
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆16Updated 3 years ago
- Codebase for probing and visualizing multilingual models.☆48Updated 5 years ago
- Neural CRF Model for Sentence Alignment in Text Simplification☆67Updated 3 months ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 4 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11Updated 5 years ago