rldotai / rl-algorithmsLinks
Reinforcement learning algorithms
☆41Updated 6 years ago
Alternatives and similar repositories for rl-algorithms
Users that are interested in rl-algorithms are comparing it to the libraries listed below
Sorting:
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- Reinforcement learning algorithms in RLlib☆59Updated last year
- Bandits Environments for the OpenAI Gym☆89Updated 5 years ago
- ☆80Updated last year
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- PyTorch code to train and evaluate Procgen tasks☆25Updated 4 years ago
- Clone of OpenAI's Spinning Up in PyTorch☆151Updated 3 years ago
- some common TD Learning algorithms☆66Updated 5 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆63Updated last year
- SafeLife: safety benchmarks for reinforcement learning agents☆61Updated 4 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 6 years ago
- ☆84Updated 4 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆204Updated 4 years ago
- ☆35Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆67Updated 7 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆36Updated 4 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 5 years ago
- Markov Decision Processes in Python☆15Updated 6 years ago
- ☆65Updated last year
- Reinforcement Learning via Latent State Decoding☆29Updated 2 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 5 years ago
- ☆44Updated 6 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- krazy grid world☆25Updated 5 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆149Updated 2 years ago
- Map-Elites based on Evolution Strategies☆32Updated 3 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆204Updated 4 years ago