Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.
☆166Sep 9, 2025Updated 9 months ago
Alternatives and similar repositories for prontoqa
Users that are interested in prontoqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆41Feb 15, 2024Updated 2 years ago
- Code Repo for "Differentiable Open-Ended Commonsense Reasoning" (NAACL 2021)☆32Jun 30, 2023Updated 3 years ago
- Official code repository for the main conference paper in EMNLP 2022: SubeventWriter: Iterative Sub-event Sequence Generation with Cohere…☆11Oct 16, 2022Updated 3 years ago
- Dataset & Code for Com2Sense Benchmark☆13Sep 8, 2021Updated 4 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains a collection of papers and resources on Reasoning in Large Language Models.☆572Nov 13, 2023Updated 2 years ago
- Source code for the paper 'Complex Hyperbolic Knowledge Graph Embeddings with Fast Fourier Transform'.☆12Nov 9, 2022Updated 3 years ago
- Implementation of generative semantic grammar.☆17Jun 2, 2022Updated 4 years ago
- Natural language understanding by probabilistic abduction of a symbolic theory from sentences and logical forms.☆18Jun 13, 2025Updated last year
- The project page for "LOGIC-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning"☆400Jun 13, 2024Updated 2 years ago
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion☆13Jul 26, 2023Updated 2 years ago
- ☆12Apr 25, 2022Updated 4 years ago
- Codes for the EMNLP2021 paper: Benchmarking Commonsense Knowledge Base Population (https://aclanthology.org/2021.emnlp-main.705.pdf). An …☆26Feb 14, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆19Feb 25, 2022Updated 4 years ago
- CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation☆14Aug 19, 2025Updated 10 months ago
- Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detec…☆30Nov 14, 2023Updated 2 years ago
- ☆37Dec 20, 2024Updated last year
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆68Feb 13, 2023Updated 3 years ago
- The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".☆28Jun 19, 2021Updated 5 years ago
- [EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning☆17Oct 31, 2023Updated 2 years ago
- ScienceMeter: Tracking Scientific Knowledge Updates in Language Models☆17Jun 28, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Repository for the code associated with the paper: Unsupervised Extractive Summarization using Mutual Information☆25Sep 11, 2021Updated 4 years ago
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them☆565Jun 25, 2024Updated 2 years ago
- Code and data for TACL paper It’s not Rocket Science: Interpreting Figurative Language in Narratives☆15Sep 4, 2023Updated 2 years ago
- ☆23Sep 2, 2024Updated last year
- A collection of research papers related to Natural Language Reasoning☆10May 27, 2022Updated 4 years ago
- A fast and neat API for Conceptualization of Probase☆17Oct 28, 2019Updated 6 years ago
- Official code repository for the main conference paper in ACL2023: COLA: Contextualized Commonsense Causality Reasoning from the Causal I…☆34May 12, 2023Updated 3 years ago
- ☆52Oct 24, 2023Updated 2 years ago
- An extensible benchmark for evaluating large language models on planning☆467Jun 2, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆17Apr 7, 2025Updated last year
- Code for EMNLP 2020 paper: Analogous Process Structure Induction for Sub-event Sequence Prediction☆11Oct 19, 2020Updated 5 years ago
- ☆37Mar 26, 2024Updated 2 years ago
- Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grap…☆13Oct 30, 2024Updated last year
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆37Jun 8, 2023Updated 3 years ago
- https://openreview.net/forum?id=OC1o4_OI6Jw☆13May 27, 2022Updated 4 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Jul 9, 2020Updated 5 years ago