jbloomAus / DecisionTransformerInterpretabilityLinks
Interpreting how transformers simulate agents performing RL tasks
☆90Updated 2 years ago
Alternatives and similar repositories for DecisionTransformerInterpretability
Users that are interested in DecisionTransformerInterpretability are comparing it to the libraries listed below
Sorting:
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆32Updated 2 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆134Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆121Updated last year
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Updated 3 years ago
- Object Centric Atari games☆98Updated last month
- Contains JAX implementation of algorithms for inverse reinforcement learning☆74Updated last year
- General Modules for JAX☆72Updated 4 months ago
- ☆37Updated 2 years ago
- ☆60Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆138Updated last year
- JAX library for MARL research☆87Updated 2 years ago
- ☆57Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Updated 3 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆109Updated last year
- Efficient baselines for autocurricula in JAX.☆206Updated last year
- Baselines for gymnax 🤖☆74Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆112Updated 2 years ago
- Atari-style POMDPs☆21Updated last week
- ☆91Updated 4 months ago
- ☆58Updated 3 years ago
- An Open-Ended Agentic Simulator☆58Updated last year
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆72Updated last year
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆33Updated 2 years ago
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆87Updated last year
- ☆109Updated last year
- ☆32Updated 4 years ago
- ☆19Updated 3 years ago
- ☆35Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆25Updated last year
- ☆19Updated 3 years ago