andrewschreiber / agentLinks
Interpretability dashboard for reinforcement learners
☆16Updated 6 years ago
Alternatives and similar repositories for agent
Users that are interested in agent are comparing it to the libraries listed below
Sorting:
- Training (hopefully) safe agents in gridworlds☆25Updated 6 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Updated 7 years ago
- ☆12Updated 4 years ago
- ☆84Updated 4 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 7 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆61Updated 4 years ago
- Implementation of https://medium.com/ai-control/alba-an-explicit-proposal-for-aligned-ai-17a55f60bbcf☆27Updated 8 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆51Updated 2 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆60Updated 2 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆75Updated 2 years ago
- Modeling agents with probabilistic programs☆67Updated 6 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated 2 years ago
- An environment for benchmarking commonsense agents☆29Updated 5 years ago
- Reinforcement learning library in JAX.☆100Updated last year
- Scaling scaling laws with board games.☆53Updated 2 years ago
- Paired Open-Ended Trailblazer (POET) and Enhanced POET☆253Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆64Updated 2 years ago
- A collection of papers on divergence and quality diversity☆77Updated 3 years ago
- An implementation of MuZero in JAX.☆57Updated 2 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Updated 4 years ago
- Algorithmic Intelligence Quotient☆39Updated 3 years ago
- Baselines for gymnax 🤖☆72Updated 2 years ago
- Standard interface for entity based reinforcement learning environments.☆38Updated last year
- PAIRED in PyTorch 🔥☆63Updated 2 years ago
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 5 years ago
- General Modules for JAX☆67Updated 3 weeks ago
- Train self-modifying neural networks with neuromodulated plasticity☆78Updated 5 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 5 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago