Interpretable Evaluation for AI Systems
☆367Mar 10, 2023Updated 3 years ago
Alternatives and similar repositories for ExplainaBoard
Users that are interested in ExplainaBoard are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The unified platform for data-related resources.☆134Mar 6, 2023Updated 3 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆194Sep 22, 2025Updated 7 months ago
- BARTScore: Evaluating Generated Text as Text Generation☆369Jun 27, 2022Updated 3 years ago
- ☆401Oct 12, 2021Updated 4 years ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,050Jan 9, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆286Oct 20, 2022Updated 3 years ago
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆97Mar 20, 2023Updated 3 years ago
- Library for Knowledge Intensive Language Tasks☆971Mar 31, 2022Updated 4 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆73Sep 22, 2025Updated 7 months ago
- Toolkit for creating, sharing and using natural language prompts.☆3,008Oct 23, 2023Updated 2 years ago
- Must-read papers on prompt-based tuning for pre-trained language models.☆4,301Jul 17, 2023Updated 2 years ago
- Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing☆651Sep 27, 2022Updated 3 years ago
- [ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723☆730Aug 29, 2022Updated 3 years ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆80Jun 3, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)☆113Apr 28, 2022Updated 4 years ago
- BERT score for text generation☆1,890Jul 30, 2024Updated last year
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆787May 19, 2024Updated last year
- [NAACL 2021] Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240☆168Oct 7, 2022Updated 3 years ago
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"☆131Apr 23, 2022Updated 4 years ago
- This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 4…☆277Mar 26, 2024Updated 2 years ago
- A Diagnostic Study of Explainability Techniques for Text Classification☆70Oct 23, 2020Updated 5 years ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,627Jun 12, 2023Updated 2 years ago
- ☆15Oct 30, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆415Jun 23, 2024Updated last year
- ☆22Feb 26, 2024Updated 2 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆118Jul 25, 2023Updated 2 years ago
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …☆3,653Apr 15, 2026Updated 2 weeks ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆68Jul 4, 2021Updated 4 years ago
- ACL2020 Tutorial: Open-Domain Question Answering☆835Jan 1, 2021Updated 5 years ago
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆133May 13, 2022Updated 3 years ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,774Apr 21, 2026Updated last week
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Unified Library for Parameter-Efficient and Modular Transfer Learning☆2,811Mar 21, 2026Updated last month
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆435Aug 17, 2022Updated 3 years ago
- Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models☆3,234Jul 19, 2024Updated last year
- Autoregressive Entity Retrieval☆798Jul 6, 2023Updated 2 years ago
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Mar 16, 2021Updated 5 years ago
- DEMix Layers for Modular Language Modeling☆54Feb 25, 2026Updated 2 months ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆151Oct 22, 2022Updated 3 years ago