AlignmentResearch / learned-plannerLinks
Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban
☆15Updated 2 months ago
Alternatives and similar repositories for learned-planner
Users that are interested in learned-planner are comparing it to the libraries listed below
Sorting:
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆63Updated 6 months ago
- ⚓️ Interactive playground for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆17Updated 3 weeks ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆35Updated 10 months ago
- Measuring the situational awareness of language models☆38Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- ☆54Updated 9 months ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆21Updated 2 years ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆150Updated 7 months ago
- ☆27Updated 3 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆61Updated 4 months ago
- ☆27Updated last year
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆57Updated 10 months ago
- ☆56Updated 2 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆49Updated 10 months ago
- Simple repository for training small reasoning models☆38Updated 6 months ago
- ☆90Updated 7 months ago
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆33Updated last year
- ☆54Updated last year
- ☆27Updated last year
- Minimum Description Length probing for neural network representations☆18Updated 7 months ago
- GoldFinch and other hybrid transformer components☆45Updated last year
- ☆121Updated 6 months ago
- [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"☆31Updated 6 months ago
- Simple GRPO scripts and configurations.☆59Updated 6 months ago
- LLM reads a paper and produce a working prototype☆57Updated 4 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- Functional Benchmarks and the Reasoning Gap☆88Updated 11 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 4 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated last week