Aleph-Alpha-Research / eval-frameworkLinks
☆19Updated this week
Alternatives and similar repositories for eval-framework
Users that are interested in eval-framework are comparing it to the libraries listed below
Sorting:
- Code for the paper "Implicit Representations of Meaning in Neural Language Models"☆54Updated 2 years ago
- ☆30Updated 3 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 3 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆24Updated 3 years ago
- Benchmark API for Multidomain Language Modeling☆25Updated 3 years ago
- ☆24Updated last year
- OpenPI dataset for tracking entities in open domain procedural text☆24Updated last year
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Updated last year
- ☆49Updated 2 years ago
- ☆84Updated last year
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆19Updated 3 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆34Updated 3 years ago
- Repository for the Question Answering via Sentence Composition (QASC) dataset☆56Updated 2 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 3 years ago
- PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021]☆57Updated 3 years ago
- Automatic metrics for GEM tasks☆67Updated 2 years ago
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Updated 4 years ago
- Evaluating Machines by their Real-World Language Use☆33Updated 2 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers☆21Updated 2 years ago
- Code, data, models for the Sherlock corpus☆58Updated 2 years ago
- DEMix Layers for Modular Language Modeling☆54Updated 4 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Updated 2 years ago
- [EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data☆40Updated 3 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 3 years ago
- The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.☆12Updated 3 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆16Updated 2 years ago
- A highly sophisticated sequence-to-sequence model for code generation☆40Updated 4 years ago
- Differentiable First-Order Logic Reasoning for Visual Question Answering☆41Updated 4 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago