jbloomAus / DecisionTransformerInterpretabilityLinks
Interpreting how transformers simulate agents performing RL tasks
☆89Updated 2 years ago
Alternatives and similar repositories for DecisionTransformerInterpretability
Users that are interested in DecisionTransformerInterpretability are comparing it to the libraries listed below
Sorting:
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆32Updated 2 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆120Updated last year
- Object Centric Atari games☆96Updated 3 weeks ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆72Updated last year
- ☆57Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆138Updated last year
- ☆128Updated last year
- Efficient baselines for autocurricula in JAX.☆205Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆134Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆111Updated 2 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Updated 3 years ago
- Baselines for gymnax 🤖☆74Updated 2 years ago
- ☆60Updated last year
- ☆37Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- JAX library for MARL research☆88Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆74Updated last year
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- ☆58Updated 3 years ago
- Scaling scaling laws with board games.☆54Updated 2 years ago
- General Modules for JAX☆72Updated 3 months ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆85Updated 2 years ago
- ☆52Updated 2 years ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆72Updated last year
- ☆90Updated 3 months ago
- PyTorch Package For Quasimetric Learning☆44Updated last year
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated last year
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).