andrewschreiber / agentLinks
Interpretability dashboard for reinforcement learners
☆16Updated 6 years ago
Alternatives and similar repositories for agent
Users that are interested in agent are comparing it to the libraries listed below
Sorting:
- Training (hopefully) safe agents in gridworlds☆25Updated 6 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Updated 7 years ago
- ☆11Updated 4 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆61Updated 4 years ago
- ☆84Updated 5 years ago
- Modeling agents with probabilistic programs☆67Updated 6 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated 2 years ago
- Scaling scaling laws with board games.☆54Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆47Updated 4 years ago
- An environment for benchmarking commonsense agents☆29Updated 5 years ago
- An implementation of MuZero in JAX.☆58Updated 3 years ago
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆60Updated 2 years ago
- Reinforcement learning library in JAX.☆100Updated 2 years ago
- ☆53Updated 2 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆75Updated 2 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 7 years ago
- Submissions for AI and Efficiency SOTA's☆56Updated 5 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆51Updated 3 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated last year
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆64Updated 2 years ago
- Einsum with einops style variable names☆18Updated last year
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 5 years ago
- A Python Toolkit for Managing a Large Number of Experiments☆31Updated last year
- A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"☆153Updated 7 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆127Updated last year
- ☆31Updated 3 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Updated 4 years ago
- A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)☆87Updated 9 months ago
- This project was moved to: https://github.com/coax-dev/coax☆161Updated 3 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆78Updated 5 years ago