google-deepmind / agent_debuggerLinks
Causal Analysis of Agent Behavior for AI Safety
☆19Updated 2 years ago
Alternatives and similar repositories for agent_debugger
Users that are interested in agent_debugger are comparing it to the libraries listed below
Sorting:
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆31Updated 3 months ago
- Repo to reproduce the First-Explore paper results☆38Updated 11 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 10 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- ☆15Updated last year
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 9 months ago
- INTeractive learning via REPresentatIon Discovery☆37Updated last year
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Updated last year
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆40Updated last week
- PushWorld: A benchmark for manipulation planning with tools and movable obstacles☆88Updated last week
- ☆37Updated 2 years ago
- ☆46Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆43Updated 2 years ago
- ☆23Updated last year
- Reinforcement learning in pure JAX.☆13Updated 9 months ago
- ☆31Updated 3 years ago
- ☆33Updated 4 years ago
- ☆35Updated last year
- A tool for recording RL trajectories.☆110Updated 4 months ago
- ☆15Updated 2 years ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆20Updated last year
- Galactic Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second☆86Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆47Updated 4 years ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- Clean RL implementation using MLX☆33Updated last year
- ☆19Updated 2 years ago
- Understanding RL vision Distill article☆24Updated 2 years ago