andrewschreiber / agent
Interpretability dashboard for reinforcement learners
☆16Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for agent
- Training (hopefully) safe agents in gridworlds☆25Updated 5 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆59Updated 3 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆17Updated 6 years ago
- An environment for benchmarking commonsense agents☆28Updated 4 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆78Updated last year
- Implementation of https://medium.com/ai-control/alba-an-explicit-proposal-for-aligned-ai-17a55f60bbcf☆27Updated 7 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 6 years ago
- ☆85Updated 4 years ago
- Library that provides environments for planning problems☆15Updated last month
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆48Updated last year
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆83Updated 4 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated last year
- PAIRED in PyTorch 🔥☆56Updated last year
- Library to compare and evaluate reward functions☆61Updated last year
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆58Updated last year
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆201Updated 4 years ago
- Redwood Research's transformer interpretability tools☆12Updated 2 years ago
- Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021☆45Updated 2 years ago
- An implementation of MuZero in JAX.☆53Updated 2 years ago
- Modeling agents with probabilistic programs☆66Updated 5 years ago
- Some hard problems for reinforcement learning.☆32Updated 6 years ago
- Baselines for gymnax 🤖☆60Updated last year
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 2 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Fully differentiable RL environments, written in Ivy.☆63Updated last year
- Opinionated library for managing hyperparameters and mutable state of machine learning training systems.☆19Updated last year
- Scaling scaling laws with board games.☆43Updated last year
- Vectorization techniques for fast population-based training.☆54Updated 2 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆72Updated last year