jbloomAus / DecisionTransformerInterpretabilityLinks
Interpreting how transformers simulate agents performing RL tasks
☆90Updated 2 years ago
Alternatives and similar repositories for DecisionTransformerInterpretability
Users that are interested in DecisionTransformerInterpretability are comparing it to the libraries listed below
Sorting:
- ☆37Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Updated last year
- ☆61Updated last year
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆34Updated 3 months ago
- Atari-style POMDPs☆23Updated last month
- Intrinsic Motivation from Artificial Intelligence Feedback☆134Updated 2 years ago
- Efficient baselines for autocurricula in JAX.☆206Updated last year
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆87Updated last year
- ☆91Updated 2 weeks ago
- Object Centric Atari games☆99Updated 2 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆112Updated 2 years ago
- ☆57Updated last year
- Learning diverse options through the Laplacian representation.☆23Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆74Updated last year
- ☆128Updated 2 years ago
- ☆110Updated last year
- JAX library for MARL research☆87Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆25Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆138Updated last year
- General Modules for JAX☆72Updated 4 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆109Updated last year
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆244Updated last month
- Baselines for gymnax 🤖☆74Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆28Updated last year
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Updated 3 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Updated 3 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆61Updated last year
- ☆46Updated last year
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆73Updated last year