andrewschreiber / agentLinks
Interpretability dashboard for reinforcement learners
☆16Updated 6 years ago
Alternatives and similar repositories for agent
Users that are interested in agent are comparing it to the libraries listed below
Sorting:
- Training (hopefully) safe agents in gridworlds☆25Updated 6 years ago
- ☆12Updated 4 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Updated 6 years ago
- Implementation of https://medium.com/ai-control/alba-an-explicit-proposal-for-aligned-ai-17a55f60bbcf☆27Updated 8 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆60Updated 4 years ago
- ☆84Updated 4 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆74Updated 2 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 6 years ago
- Modeling agents with probabilistic programs☆67Updated 5 years ago
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆59Updated 2 years ago
- An environment for benchmarking commonsense agents☆29Updated 4 years ago
- Scaling scaling laws with board games.☆49Updated 2 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated last year
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆50Updated 2 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆191Updated 2 years ago
- ☆80Updated 3 years ago
- A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"☆153Updated 6 years ago
- ☆9Updated 5 years ago
- The Mixing method: coordinate descent for low-rank semidefinite programming☆15Updated 4 years ago
- Language-annotated Abstraction and Reasoning Corpus☆88Updated 2 years ago
- Local experiment manager☆13Updated 5 months ago
- ☆24Updated 6 years ago
- A simple moving dot environment for OpenAI Gym to test reinforcement learning algorithms☆23Updated 2 years ago
- Modular framework for Reinforcement Learning in python☆173Updated 2 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 3 years ago
- Train self-modifying neural networks with neuromodulated plasticity☆77Updated 5 years ago
- AIXIjs - General Reinforcement Learning in the Browser☆148Updated 4 years ago
- Command-line recursive question-answering with immutable contexts and explicit data store☆26Updated 6 years ago