neulab / ExplainaBoardLinks

Interpretable Evaluation for AI Systems

☆366

Alternatives and similar repositories for ExplainaBoard

Users that are interested in ExplainaBoard are comparing it to the libraries listed below

Sorting:

allenai / acl2022-zerofewshot-tutorial
☆292Updated 2 years ago
princeton-nlp / DensePhrases
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…
☆604Updated 3 years ago
alexa / dialoglue
DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
☆283Updated 2 years ago
facebookresearch / anli
Adversarial Natural Language Inference Benchmark
☆397Updated 3 years ago
allenai / naacl2021-longdoc-tutorial
☆345Updated 4 years ago
yaoxingcheng / TLM
ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework
☆255Updated last year
urvashik / knnlm
☆321Updated 4 years ago
timoschick / dino
This repository contains the code for "Generating Datasets with Pretrained Language Models".
☆188Updated 3 years ago
neulab / BARTScore
BARTScore: Evaluating Generated Text as Text Generation
☆356Updated 3 years ago
JohnGiorgi / DeCLUTR
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…
☆380Updated 2 years ago
Eric-Wallace / interpretability-tutorial-emnlp2020
Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"
☆199Updated 4 years ago
bplank / awesome-neural-adaptation-in-NLP
Awesome Neural Adaptation in Natural Language Processing. A curated list. https://arxiv.org/abs/2006.00632
☆266Updated 4 years ago
AIPHES / emnlp19-moverscore
MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance
☆208Updated last year
salesforce / GeDi
GeDi: Generative Discriminator Guided Sequence Generation
☆211Updated last month
neulab / InterpretEval
Interpretable Evaluation for (Almost) All NLP Tasks
☆195Updated 2 years ago
allenai / unifiedqa
UnifiedQA: Crossing Format Boundaries With a Single QA System
☆441Updated 3 years ago
jayded / eraserbenchmark
A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/
☆97Updated 2 years ago
facebookresearch / PAQ
Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"
☆203Updated 3 years ago
voidism / DiffCSE
Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
☆294Updated 2 years ago
allenai / dont-stop-pretraining
Code associated with the Don't Stop Pretraining ACL 2020 paper
☆532Updated 3 years ago
IntelLabs / academic-budget-bert
Repository containing code for "How to Train BERT with an Academic Budget" paper
☆314Updated last year
rrmenon10 / ADAPET
[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training
☆153Updated 3 years ago
google-research-datasets / tydiqa
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …
☆310Updated 5 years ago
zcgzcgzcg1 / ACL2022_KnowledgeNLP_Tutorial
Materials for ACL-2022 tutorial: Knowledge-Augmented Methods for Natural Language Processing
☆288Updated 2 years ago
Yale-LILY / dart
Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"
☆155Updated 2 years ago
google-deepmind / xquad
☆199Updated 3 years ago
facebookresearch / MLQA
New dataset
☆306Updated 3 years ago
luyug / Condenser
EMNLP 2021 - Pre-training architectures for dense retrieval
☆251Updated 3 years ago
yg211 / bert_nli
A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)
☆132Updated last year
castorini / pygaggle
a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini
☆350Updated last year