neulab / ExplainaBoard
Interpretable Evaluation for AI Systems
☆363Updated 2 years ago
Alternatives and similar repositories for ExplainaBoard:
Users that are interested in ExplainaBoard are comparing it to the libraries listed below
- ☆292Updated 2 years ago
- Officially supported AllenNLP models☆540Updated 2 years ago
- ☆345Updated 3 years ago
- [ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723☆728Updated 2 years ago
- ☆318Updated 3 years ago
- ☆397Updated 3 years ago
- Adversarial Natural Language Inference Benchmark☆393Updated 2 years ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆312Updated last year
- DGMs for NLP. A roadmap.☆389Updated 2 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆205Updated last year
- Materials for ACL-2022 tutorial: Knowledge-Augmented Methods for Natural Language Processing☆287Updated 2 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆362Updated 3 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆345Updated 2 years ago
- [NAACL 2021] Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240☆167Updated 2 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆603Updated 2 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆259Updated last year
- ☆229Updated 4 years ago
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆529Updated 3 years ago
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆292Updated last year
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆152Updated 2 years ago
- A list of publications on NLP interpretability (Welcome PR)☆168Updated 4 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆642Updated 2 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆247Updated 3 years ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆327Updated last year
- Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"☆199Updated 4 years ago
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆282Updated last year
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆781Updated 10 months ago
- MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf☆291Updated 3 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago