Yale-LILY / FOLIO
☆117Updated last year
Alternatives and similar repositories for FOLIO:
Users that are interested in FOLIO are comparing it to the libraries listed below
- EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443☆84Updated 6 months ago
- ☆46Updated 3 months ago
- ☆82Updated 2 years ago
- A unified benchmark for math reasoning☆87Updated 2 years ago
- Code to reproduce experiments in the paper "Constrained Language Models Yield Few-Shot Semantic Parsers" (EMNLP 2021).☆62Updated 9 months ago
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆71Updated 2 years ago
- ☆44Updated last year
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆83Updated last year
- ☆137Updated 2 years ago
- LogiTorch is a PyTorch-based library for logical reasoning on natural language☆70Updated 6 months ago
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆138Updated 5 months ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆77Updated 11 months ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- ☆48Updated 2 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20Updated 2 years ago
- The code of Paper "Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text".☆44Updated 2 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆101Updated 4 years ago
- 🔗 LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]☆63Updated last year
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆47Updated last year
- Author implementation of the paper "CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge"☆158Updated 8 months ago
- EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560☆58Updated 3 weeks ago
- ☆58Updated 2 years ago
- ☆75Updated last year
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.☆100Updated last year
- Code for generating the JuICe dataset.☆36Updated 3 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 3 weeks ago
- ☆71Updated 11 months ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 2 years ago
- ☆83Updated 2 years ago
- Grammar Prompting for Domain-Specific Language Generation with Large Language Models☆65Updated last year