jbloomAus / DecisionTransformerInterpretabilityLinks
Interpreting how transformers simulate agents performing RL tasks
☆82Updated last year
Alternatives and similar repositories for DecisionTransformerInterpretability
Users that are interested in DecisionTransformerInterpretability are comparing it to the libraries listed below
Sorting:
- Object Centric Atari games☆78Updated this week
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆21Updated 2 months ago
- ☆79Updated 2 months ago
- ☆34Updated 2 years ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆29Updated 9 months ago
- General Modules for JAX☆66Updated last month
- Baselines for gymnax 🤖☆66Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated last year
- ☆20Updated 2 years ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆22Updated last month
- ☆19Updated 2 years ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆27Updated 10 months ago
- Redwood Research's transformer interpretability tools☆15Updated 3 years ago
- Repo to reproduce the First-Explore paper results☆37Updated 5 months ago
- ☆128Updated last year
- ☆44Updated 8 months ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated 9 months ago
- Mechanistic Interpretability for Transformer Models☆51Updated 3 years ago
- A reinforcement learning environment for the IGLU 2022 at NeurIPS☆33Updated 2 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆99Updated 7 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆112Updated 9 months ago
- An Open-Ended Agentic Simulator☆51Updated 9 months ago
- ☆53Updated 7 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆142Updated last year
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆56Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Updated last year
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆207Updated last year