AlignmentResearch / learned-plannerLinks
Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban
☆15Updated 3 months ago
Alternatives and similar repositories for learned-planner
Users that are interested in learned-planner are comparing it to the libraries listed below
Sorting:
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 7 months ago
- ⚓️ Interactive playground for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆17Updated 2 months ago
- Measuring the situational awareness of language models☆38Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- Simple GRPO scripts and configurations.☆59Updated 8 months ago
- ☆55Updated 11 months ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Updated last year
- ☆24Updated 6 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆28Updated 2 years ago
- ☆56Updated 3 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆151Updated 8 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆61Updated 6 months ago
- ☆102Updated 9 months ago
- ☆30Updated 5 months ago
- Evaluation of neuro-symbolic engines☆39Updated last year
- Minimum Description Length probing for neural network representations☆20Updated 8 months ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluation☆52Updated this week
- ☆122Updated 7 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆35Updated 11 months ago
- ☆98Updated 2 months ago
- Can Language Models Solve Olympiad Programming?☆118Updated 9 months ago
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆55Updated 7 months ago
- ☆28Updated 6 months ago
- ☆23Updated last year
- ☆93Updated 4 months ago
- A Gymnasium-based Environment of the Abstraction and Reasoning Corpus (ARC)☆68Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆71Updated last year
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆59Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆59Updated this week
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆21Updated 2 years ago