google-deepmind / agent_debugger
Causal Analysis of Agent Behavior for AI Safety
☆18Updated last year
Alternatives and similar repositories for agent_debugger
Users that are interested in agent_debugger are comparing it to the libraries listed below
Sorting:
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- Repo to reproduce the First-Explore paper results☆37Updated 4 months ago
- ☆31Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- ☆13Updated 10 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆20Updated 10 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆28Updated 6 months ago
- ☆44Updated 7 months ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- A web based platform for collecting human actions in reinforcement learning environments☆28Updated last year
- ☆19Updated 2 weeks ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆10Updated last year
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆57Updated 2 months ago
- Learn online intrinsic rewards from LLM feedback☆37Updated 4 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 8 months ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆34Updated last month
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- Reinforcement learning in pure JAX.☆12Updated 2 months ago
- Vintix: Action Model via In-Context Reinforcement Learning - - —☆35Updated this week
- Generative cellular automaton-like learning environments for RL.☆19Updated 3 months ago
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆36Updated last year
- PyTorch Package For Quasimetric Learning☆42Updated 6 months ago
- flexible meta-learning in jax☆13Updated last year
- INTeractive learning via REPresentatIon Discovery☆34Updated 11 months ago
- ☆36Updated 2 years ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated 8 months ago
- ☆34Updated 2 years ago
- ☆20Updated 7 months ago
- Action Value Gradient Algorithm☆20Updated last month