andrewschreiber / agentLinks
Interpretability dashboard for reinforcement learners
☆16Updated 6 years ago
Alternatives and similar repositories for agent
Users that are interested in agent are comparing it to the libraries listed below
Sorting:
- Training (hopefully) safe agents in gridworlds☆25Updated 6 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Updated 7 years ago
- ☆12Updated 4 years ago
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆59Updated 2 years ago
- ☆84Updated 4 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆61Updated 4 years ago
- An environment for benchmarking commonsense agents☆29Updated 5 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆74Updated 2 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated 2 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆51Updated 2 years ago
- Modeling agents with probabilistic programs☆67Updated 6 years ago
- A formalisation of Cartesian Frames, a perspective on embedded agency, in the HOL theorem prover.☆19Updated 3 years ago
- Submissions for AI and Efficiency SOTA's☆57Updated 5 years ago
- Implementation of https://medium.com/ai-control/alba-an-explicit-proposal-for-aligned-ai-17a55f60bbcf☆27Updated 8 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 7 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆22Updated 3 years ago
- Reinforcement learning library in JAX.☆100Updated last year
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆191Updated 2 years ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆241Updated 2 years ago
- Neural network verification in JAX☆145Updated 2 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆64Updated 2 years ago
- Functional machine learning for fun☆85Updated 4 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆207Updated 4 years ago
- Procgen2: A community maintained fork of procgen☆11Updated 3 years ago
- Redwood Research's transformer interpretability tools☆14Updated 3 years ago
- Scaling scaling laws with board games.☆53Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"☆153Updated 6 years ago