AlignmentResearch / learned-plannerLinks
Interpretability tools for recurrent convolutional networks (DRC) that play Sokoban
☆13Updated last week
Alternatives and similar repositories for learned-planner
Users that are interested in learned-planner are comparing it to the libraries listed below
Sorting:
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆58Updated 4 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- Code repo for MathAgent☆16Updated last year
- A quick way to get started with Transformer Lens☆14Updated last year
- ☆21Updated last year
- Simple repository for training small reasoning models☆33Updated 4 months ago
- ☆22Updated 8 months ago
- Open-source Human Feedback Library☆11Updated last year
- Measuring the situational awareness of language models☆35Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- ☆51Updated 7 months ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆12Updated 5 months ago
- ☆23Updated 2 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 2 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 7 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆33Updated 8 months ago
- ☆50Updated last month
- ☆42Updated 9 months ago
- Simple GRPO scripts and configurations.☆59Updated 4 months ago
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆22Updated 4 months ago
- ☆11Updated 11 months ago
- ☆21Updated last month
- ☆32Updated 5 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆26Updated 6 months ago
- ☆24Updated 9 months ago
- ☆61Updated 3 weeks ago
- NeurIPS 2024 tutorial on LLM Inference☆45Updated 6 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Latent Large Language Models☆18Updated 10 months ago