AlignmentResearch / learned-planner
Interpretability tools for recurrent convolutional networks (DRC) that play Sokoban
☆12Updated 2 months ago
Alternatives and similar repositories for learned-planner
Users that are interested in learned-planner are comparing it to the libraries listed below
Sorting:
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Code repo for MathAgent☆16Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated last month
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆33Updated 7 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆25Updated 5 months ago
- ☆21Updated last year
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆12Updated this week
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- ☆48Updated 6 months ago
- Collection of LLM completions for reasoning-gym task datasets☆20Updated last week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 11 months ago
- Minimum Description Length probing for neural network representations☆19Updated 3 months ago
- ☆23Updated last month
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆26Updated 10 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- ☆29Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 6 months ago
- Measuring the situational awareness of language models☆34Updated last year
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆33Updated 6 months ago
- A quick way to get started with Transformer Lens☆14Updated last year
- Elevate your language models with insightful diversity metrics.☆11Updated last year
- Simple repository for training small reasoning models☆27Updated 3 months ago
- ☆50Updated 5 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆23Updated last month
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆64Updated last year
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- ☆14Updated last year
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆18Updated this week