jbloomAus / DecisionTransformerInterpretability
Interpreting how transformers simulate agents performing RL tasks
☆80Updated last year
Alternatives and similar repositories for DecisionTransformerInterpretability
Users that are interested in DecisionTransformerInterpretability are comparing it to the libraries listed below
Sorting:
- ☆19Updated 2 years ago
- ☆77Updated last month
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆112Updated 8 months ago
- Baselines for gymnax 🤖☆66Updated 2 years ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated 8 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated 8 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆51Updated 2 years ago
- General Modules for JAX☆65Updated last month
- ☆34Updated 2 years ago
- Scaling scaling laws with board games.☆48Updated last year
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 11 months ago
- Tools for studying developmental interpretability in neural networks.☆89Updated 3 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆55Updated 2 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆97Updated 6 months ago
- An Open-Ended Agentic Simulator☆49Updated 9 months ago
- Nethack Learning Environment Wrapper for Language Interface☆37Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆10Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Various reinforcement learning algorithms written in Jax + Flax☆24Updated last year
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆28Updated 8 months ago
- Repo to reproduce the First-Explore paper results☆37Updated 4 months ago
- ☆20Updated 2 years ago
- ☆79Updated 6 months ago
- Object Centric Atari games☆76Updated last week
- Simple single-file baselines for Q-Learning in pure-GPU setting☆161Updated last month
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆82Updated last year
- Mechanistic Interpretability for Transformer Models☆50Updated 2 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆48Updated 10 months ago
- Accelerated minigrid environments with JAX☆135Updated 9 months ago
- Learning diverse options through the Laplacian representation.☆23Updated last year