pkargupta / cognitive_foundationsLinks
A framework bridging cognitive science and LLM reasoning research to diagnose and improve how large language models reason, based on analysis of 192K model traces and 54 human think-aloud traces.
☆31Updated 2 months ago
Alternatives and similar repositories for cognitive_foundations
Users that are interested in cognitive_foundations are comparing it to the libraries listed below
Sorting:
- WONDERBREAD benchmark + dataset for BPM tasks☆34Updated 6 months ago
- ☆227Updated 11 months ago
- Process Reward Models That Think☆77Updated 2 months ago
- ☆130Updated this week
- ☆45Updated 7 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆50Updated last year
- ☆24Updated 9 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆65Updated 11 months ago
- ☆35Updated 8 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆148Updated last year
- ☆50Updated 11 months ago
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆108Updated 7 months ago
- This repository is maintained to release dataset and models for multimodal puzzle reasoning.☆113Updated 11 months ago
- ☆37Updated 8 months ago
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆226Updated this week
- ☆33Updated last year
- Solving Inequality Proofs with Large Language Models.☆56Updated last month
- Official repo of paper LM2☆46Updated 11 months ago
- SSRL: Self-Search Reinforcement Learning☆205Updated 5 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 8 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 9 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated last year
- Inverse Scaling in Test-Time Compute☆24Updated last month
- ☆72Updated 7 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆71Updated 11 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆28Updated 10 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆55Updated 6 months ago
- ☆108Updated last month
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆123Updated 5 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Updated last year