rltheorybook / rltheorybook.github.io
☆27Updated 2 years ago
Related projects: ⓘ
- Library to compare and evaluate reward functions☆61Updated 10 months ago
- Reinforcement learning algorithms☆39Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆77Updated 11 months ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆35Updated 3 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆59Updated 3 years ago
- PAIRED in PyTorch 🔥☆56Updated last year
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 4 years ago
- Reinforcement learning library in JAX.☆102Updated 10 months ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- ☆80Updated 11 months ago
- A job launching library for docker, EC2, GCP, etc.☆57Updated 3 years ago
- ☆85Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆83Updated 4 years ago
- Collection of in-progress libraries for entity neural networks.☆29Updated 2 years ago
- Open-source library for a reinforcement learning research.☆54Updated last year
- impact-driven-exploration☆125Updated 11 months ago
- Map-Elites based on Evolution Strategies☆31Updated 2 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆85Updated 5 years ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆121Updated 3 weeks ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆47Updated last year
- The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.☆119Updated 2 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆54Updated last year
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆26Updated 4 years ago
- ☆28Updated 2 years ago
- Augmented environments with RL☆102Updated 5 years ago
- Reinforcement Learning via Latent State Decoding☆30Updated last year