andrewschreiber / agentLinks
Interpretability dashboard for reinforcement learners
☆16Updated 6 years ago
Alternatives and similar repositories for agent
Users that are interested in agent are comparing it to the libraries listed below
Sorting:
- Training (hopefully) safe agents in gridworlds☆25Updated 6 years ago
- ☆11Updated 4 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Updated 7 years ago
- Modeling agents with probabilistic programs☆67Updated 6 years ago
- Implementation of https://medium.com/ai-control/alba-an-explicit-proposal-for-aligned-ai-17a55f60bbcf☆27Updated 8 years ago
- ☆84Updated 5 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated 2 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆61Updated 4 years ago
- Language-annotated Abstraction and Reasoning Corpus☆95Updated 2 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆51Updated 2 years ago
- Reinforcement learning library in JAX.☆100Updated 2 years ago
- Scaling scaling laws with board games.☆53Updated 2 years ago
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆60Updated 2 years ago
- AIXIjs - General Reinforcement Learning in the Browser☆147Updated 5 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 7 years ago
- A toy model of Friston's active inference in Tensorflow☆41Updated 8 years ago
- An environment for benchmarking commonsense agents☆29Updated 5 years ago
- A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)☆86Updated 8 months ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆76Updated 2 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆129Updated 2 years ago
- Reinforcement learning algorithms☆41Updated 6 years ago
- ☆52Updated 2 years ago
- A programming language for formal/informal computation.☆41Updated 3 months ago
- The Mixing method: coordinate descent for low-rank semidefinite programming☆15Updated 4 years ago
- Probabilistic Programming eXecution protocol (PPX)☆76Updated 3 years ago
- Code for the Neural GPU model originally described in "Neural GPUs Learn Algorithms"☆145Updated 7 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- A collection of papers on divergence and quality diversity☆78Updated 3 years ago
- This project was moved to: https://github.com/coax-dev/coax☆160Updated 2 years ago
- A Python library for defining structured command-line flags.☆32Updated 4 months ago