Framework for testing models with AI2 leaderboards
☆21Nov 8, 2023Updated 2 years ago
Alternatives and similar repositories for ai2
Users that are interested in ai2 are comparing it to the libraries listed below
Sorting:
- Code repo for EMNLP 2019 WIQA dataset paper☆13Jun 12, 2023Updated 2 years ago
- Leaderboard implementations for datasets produced by the Mosaic Team.☆20Jul 6, 2023Updated 2 years ago
- ☆34Oct 30, 2020Updated 5 years ago
- Tools and datasets for Aristo Leaderboards☆42May 17, 2021Updated 4 years ago
- A tool for extracting plain text from Wikipedia dumps☆15Sep 13, 2018Updated 7 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- Dataset & Code for Com2Sense Benchmark☆13Sep 8, 2021Updated 4 years ago
- ☆38Jul 20, 2020Updated 5 years ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Apr 30, 2025Updated 10 months ago
- Repository for the What's Missing EMNLP'19 paper☆17Mar 12, 2021Updated 4 years ago
- Commonsense Explanations for Commonsense Question Answering☆13Jun 27, 2019Updated 6 years ago
- Author implementation of the paper "CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge"☆168Jul 25, 2024Updated last year
- PyTorch implementation of L2R2 in SIGIR 2020☆17Jun 12, 2023Updated 2 years ago
- Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"☆23Oct 26, 2021Updated 4 years ago
- Finding Generalizable Evidence by Learning to Convince Q&A Models☆25Jan 5, 2023Updated 3 years ago
- "Learning What is Essential in Questions", CoNLL, 2017☆26Aug 3, 2018Updated 7 years ago
- Code for ModularQA☆28Jun 8, 2021Updated 4 years ago
- A paper list of research conducted based on wikiHow☆27Mar 5, 2022Updated 4 years ago
- Commonsense Ability Tests☆29Mar 8, 2022Updated 4 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Aug 13, 2020Updated 5 years ago
- Code Repo for "Differentiable Open-Ended Commonsense Reasoning" (NAACL 2021)☆32Jun 30, 2023Updated 2 years ago
- An original implementation of EMNLP 2019, "A Discrete Hard EM Approach for Weakly Supervised Question Answering"☆135Jul 3, 2020Updated 5 years ago
- ☆75Apr 4, 2024Updated last year
- Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"☆36Nov 3, 2021Updated 4 years ago
- ACL'23: Unified Demonstration Retriever for In-Context Learning☆38Dec 2, 2023Updated 2 years ago
- ☆10Oct 2, 2024Updated last year
- ☆10Oct 20, 2020Updated 5 years ago
- Source code accompanying the NeurIPS 2022 paper "Learning Partial Equivariances From Data"☆10Nov 18, 2022Updated 3 years ago
- FaVIQ: Fact Verification from Information-seeking Questions☆43Nov 23, 2022Updated 3 years ago
- Improving Machine Reading Comprehension with General Reading Strategies☆37Apr 23, 2019Updated 6 years ago
- This repository provides the dataset used in "Schema-Guided Natural Language Generation" by Yuheng Du, Shereen Oraby, Vittorio Perera, Mi…☆13Dec 8, 2020Updated 5 years ago
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 5 years ago
- Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! 🔥🚀💻☆14Jun 15, 2024Updated last year
- ☆11Apr 4, 2018Updated 7 years ago
- Natural language dataset for training a Conversational Recommender System☆11Jul 9, 2019Updated 6 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- This is the paddle code for SeBoW(Self-Born wiring for neural trees), a kind of neural tree born form a large search space☆11Dec 10, 2021Updated 4 years ago
- This repository contains the dataset and implementation details of the paper "An In-depth Analysis of Implicit and Subtle Hate Speech Mes…☆10May 9, 2024Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Sep 23, 2024Updated last year