facebookresearch / anli
Adversarial Natural Language Inference Benchmark
☆393Updated 2 years ago
Alternatives and similar repositories for anli:
Users that are interested in anli are comparing it to the libraries listed below
- Interpretable Evaluation for AI Systems☆363Updated 2 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆301Updated 4 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆432Updated 2 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- ☆190Updated 3 years ago
- ☆318Updated 3 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆640Updated 2 years ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆312Updated last year
- GeDi: Generative Discriminator Guided Sequence Generation☆208Updated 2 years ago
- Scripts and links to recreate the ELI5 dataset.☆324Updated 3 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- New dataset☆303Updated 3 years ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆388Updated 9 months ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆719Updated last year
- ☆159Updated 5 years ago
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆292Updated last year
- PyTorch original implementation of "Unsupervised Question Decomposition for Question Answering"☆120Updated last year
- Few-shot Learning of GPT-3☆346Updated last year
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆362Updated 3 years ago
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆282Updated last year
- Library for Knowledge Intensive Language Tasks☆935Updated 2 years ago
- Officially supported AllenNLP models☆538Updated 2 years ago
- Heuristic Analysis for NLI Systems☆125Updated 4 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆344Updated 2 years ago
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆270Updated 2 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆96Updated 2 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆118Updated 2 years ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆327Updated last year
- ☆293Updated 2 years ago
- A list of publications on NLP interpretability (Welcome PR)☆168Updated 4 years ago