google-deepmind / agent_debuggerLinks

Causal Analysis of Agent Behavior for AI Safety

☆18

Alternatives and similar repositories for agent_debugger

Users that are interested in agent_debugger are comparing it to the libraries listed below

Sorting:

Farama-Foundation / CrowdPlay
A web based platform for collecting human actions in reinforcement learning environments
☆30Updated last year
CEC-Agent / CEC
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆31Updated last year
btnorman / First-Explore
Repo to reproduce the First-Explore paper results
☆37Updated 6 months ago
google-deepmind / enn_acme
☆31Updated 2 years ago
brentyi / minGPT-flax
GPT implementation in Flax
☆18Updated 3 years ago
FLAIROx / cultural-accumulation
☆12Updated last year
danijar / granular
Fast dataset format and loader
☆22Updated 6 months ago
keraJLi / synthetic-gymnax
Drop-in environment replacements that make your RL algorithm train faster.
☆21Updated last year
facebookresearch / macta
MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection
☆46Updated 2 years ago
facebookresearch / ssorl
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
☆42Updated 2 years ago
NVlabs / gbrl_sb3
GBRL-based Actor-Critic algorithms implemented in stable-baselines3
☆35Updated 2 weeks ago
ml-jku / LRAM
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
☆33Updated 8 months ago
google-deepmind / csuite
☆44Updated 9 months ago
google-deepmind / tell_me_why_explanations_rl
☆36Updated 2 years ago
rail-berkeley / SUPE
This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."
☆28Updated this week
distillpub / post--understanding-rl-vision
Understanding RL vision Distill article
☆23Updated 2 years ago
smearle / autoverse
Generative cellular automaton-like learning environments for RL.
☆19Updated 5 months ago
changchencc / PlanDQ
PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer
☆10Updated last year
facebookresearch / oni
Learn online intrinsic rewards from LLM feedback
☆41Updated 7 months ago
ThomasMiconi / Meta-Task-Generator
Automatically generate simple meta-learning tasks from a very large space
☆15Updated last year
google-deepmind / envlogger
A tool for recording RL trajectories.
☆103Updated 8 months ago
princeton-nlp / lwm
We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…
☆24Updated last year
Farama-Foundation / Jumpy
On-the-fly conversions between Jax and NumPy tensors
☆52Updated 2 years ago
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆18Updated 5 months ago
luchris429 / discovered-policy-optimisation
Code for Discovered Policy Optimisation (NeurIPS 2022)
☆11Updated 2 years ago
abaheti95 / LoL-RL
Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients
☆26Updated 10 months ago
arnavkj1995 / SFM
Official Implementation of SFM and the baselines in Jax.
☆19Updated last month
dunnolab / vintix
Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025
☆42Updated last month
IBM / NL2PDDL
this is for fun, ain't it grand!
☆20Updated 2 months ago
facebookresearch / ExPLORe
This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".
☆24Updated last year