jbloomAus / DecisionTransformerInterpretability
Interpreting how transformers simulate agents performing RL tasks
☆78Updated last year
Alternatives and similar repositories for DecisionTransformerInterpretability:
Users that are interested in DecisionTransformerInterpretability are comparing it to the libraries listed below
- ☆74Updated 7 months ago
- ☆19Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 6 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆142Updated this week
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆76Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆71Updated 7 months ago
- General Modules for JAX☆64Updated 3 weeks ago
- ☆81Updated 8 months ago
- PAIRED in PyTorch 🔥☆58Updated 2 years ago
- ☆34Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 10 months ago
- Baselines for gymnax 🤖☆66Updated last year
- ☆47Updated 10 months ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆205Updated last year
- An Open-Ended Agentic Simulator☆45Updated 7 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆22Updated 3 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆141Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆114Updated last month
- ☆35Updated last year
- ☆52Updated 10 months ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆27Updated 6 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆93Updated 4 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆53Updated 11 months ago
- Object Centric Atari games☆70Updated this week
- Synchronized Curriculum Learning for RL Agents☆41Updated this week
- Scalable Opponent Shaping Experiments in JAX☆24Updated 11 months ago
- ☆73Updated 4 months ago
- ☆28Updated 2 years ago
- ☆20Updated 2 years ago