dspub99 / betazero
Tabula Rasa Tic-Tac-Toe
☆10Updated 6 years ago
Alternatives and similar repositories for betazero:
Users that are interested in betazero are comparing it to the libraries listed below
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- ☆27Updated 5 years ago
- Logarithmic Reinforcement Learning☆26Updated last year
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 6 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆93Updated 6 years ago
- Official implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinf…☆11Updated 3 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- Codebase of Santara et. al., RAIL: Risk Averse Imitation Learning, Published in AAMAS 2018☆14Updated 3 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 6 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 6 years ago
- Great resources for learning optimal control☆17Updated 5 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- Reinforcement Learning via Latent State Decoding☆30Updated last year
- Repository for "Known Unknowns: Uncertainty Quality in Bayesian Neural Networks" paper.☆12Updated 7 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods - EWRL Workshop 2018☆15Updated 6 years ago
- ☆28Updated 5 years ago
- ☆35Updated 6 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆20Updated 7 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- Python package for inference with Gaussian processes☆11Updated 9 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 7 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 6 years ago
- Separating value functions across time-scales.☆17Updated 5 years ago