dspub99 / betazero
Tabula Rasa Tic-Tac-Toe
☆10Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for betazero
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 6 years ago
- Logarithmic Reinforcement Learning☆26Updated last year
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 6 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 7 years ago
- Reinforcement Learning via Latent State Decoding☆30Updated last year
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 6 years ago
- ☆26Updated 5 years ago
- Reinforcement learning in TensorFlow 2☆22Updated 2 years ago
- Official implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinf…☆12Updated 3 years ago
- A simple Gridworld environment for Open AI gym☆24Updated 6 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 4 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Updated 2 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆17Updated 7 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆17Updated 5 years ago
- Reinforcement learning algorithms☆40Updated 5 years ago
- Implementation of Receding Horizon Curiosity Algrithm☆13Updated last year
- SafeLife: safety benchmarks for reinforcement learning agents☆59Updated 3 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆45Updated 5 years ago
- Modeling agents with probabilistic programs☆66Updated 5 years ago
- Online demo of DRLViz, an interactive tool to understand decisions and memory in Deep Reinforcement Learning☆15Updated last year
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆48Updated last year
- Pytorch-based python library for continuous reinforcement learning and imitation learning [superseded by @osudrl/apex]☆13Updated 4 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 3 years ago
- Template for fast, efficient, and simple Reinforcement Learning☆37Updated 3 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 6 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 6 years ago
- Safe Reinforcement Learning algorithms☆70Updated 2 years ago