AlignmentResearch / learned-plannerLinks
Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban
☆15Updated 6 months ago
Alternatives and similar repositories for learned-planner
Users that are interested in learned-planner are comparing it to the libraries listed below
Sorting:
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 10 months ago
- Measuring the situational awareness of language models☆39Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆152Updated 11 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Updated last year
- ☆34Updated 7 months ago
- ☆124Updated 10 months ago
- ☆105Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆134Updated 2 years ago
- Materials for ConceptARC paper☆110Updated last year
- ☆113Updated 3 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆66Updated 10 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆63Updated 9 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆57Updated 11 months ago
- A Gymnasium-based Environment of the Abstraction and Reasoning Corpus (ARC)☆69Updated last year
- ☆164Updated 4 months ago
- Simple GRPO scripts and configurations.☆59Updated 11 months ago
- ☆55Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Updated last year
- Code for☆28Updated last year
- A Python library for automatically solving Abstraction and Reasoning Corpus (ARC) challenges using Claude and object-centric modeling.☆25Updated last year
- Can Language Models Solve Olympiad Programming?☆124Updated 11 months ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluation☆60Updated this week
- A virtual environment for developing and evaluating automated scientific discovery agents.☆198Updated 10 months ago
- Clean RL implementation using MLX☆34Updated last year
- ☆63Updated 6 months ago
- ☆30Updated 6 months ago
- ☆91Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last month
- Learning Universal Predictors☆81Updated last year