dspub99 / betazero
Tabula Rasa Tic-Tac-Toe
☆10Updated 6 years ago
Alternatives and similar repositories for betazero
Users that are interested in betazero are comparing it to the libraries listed below
Sorting:
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Logarithmic Reinforcement Learning☆26Updated 2 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 5 years ago
- ☆27Updated 6 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 6 years ago
- A Test-Implementation of the IMPALA algorithm (by deepmind 2018)☆35Updated 7 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆95Updated 6 years ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆25Updated 6 years ago
- Template for fast, efficient, and simple Reinforcement Learning☆37Updated 3 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆60Updated 4 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 6 years ago
- Decoupling Dynamics and Reward for Transfer Learning☆16Updated 6 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆31Updated 4 years ago
- ☆35Updated 6 years ago
- Great resources for learning optimal control☆17Updated 5 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- Online demo of DRLViz, an interactive tool to understand decisions and memory in Deep Reinforcement Learning☆15Updated 2 years ago
- Train a Bipedal Robot to walk using Reinforcement Learning☆9Updated 6 years ago
- ☆54Updated 2 years ago
- Reinforcement learning in TensorFlow 2☆22Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Proximal Policy Optimization in PyTorch☆39Updated 7 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 6 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- Gridworld environments for OpenAI gym.☆80Updated last year
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago