google-deepmind / agent_debuggerLinks
Causal Analysis of Agent Behavior for AI Safety
☆18Updated 2 years ago
Alternatives and similar repositories for agent_debugger
Users that are interested in agent_debugger are comparing it to the libraries listed below
Sorting:
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- Repo to reproduce the First-Explore paper results☆37Updated 6 months ago
- ☆31Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- ☆12Updated last year
- Fast dataset format and loader☆22Updated 6 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated 2 years ago
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆35Updated 2 weeks ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆33Updated 8 months ago
- ☆44Updated 9 months ago
- ☆36Updated 2 years ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆28Updated this week
- Understanding RL vision Distill article☆23Updated 2 years ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 5 months ago
- PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer☆10Updated last year
- Learn online intrinsic rewards from LLM feedback☆41Updated 7 months ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- A tool for recording RL trajectories.☆103Updated 8 months ago
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆24Updated last year
- On-the-fly conversions between Jax and NumPy tensors☆52Updated 2 years ago
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 10 months ago
- Official Implementation of SFM and the baselines in Jax.☆19Updated last month
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆42Updated last month
- this is for fun, ain't it grand!☆20Updated 2 months ago
- This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".☆24Updated last year