dspub99 / betazeroLinks
Tabula Rasa Tic-Tac-Toe
☆10Updated 6 years ago
Alternatives and similar repositories for betazero
Users that are interested in betazero are comparing it to the libraries listed below
Sorting:
- SafeLife: safety benchmarks for reinforcement learning agents☆61Updated 4 years ago
- presentations☆44Updated 6 years ago
- Modeling agents with probabilistic programs☆67Updated 6 years ago
- Compression algorithms (like the well-known zip file compression) can be used for machine learning purposes, specifically for classifying…☆35Updated 5 years ago
- Logarithmic Reinforcement Learning☆26Updated 2 years ago
- ☆27Updated 6 years ago
- 2019 talk at GECCO☆68Updated 6 years ago
- 3d cartpole gym env using bullet physics trained from pixels with tensorflow LRPG, DDPG & NAF☆58Updated 8 years ago
- Official implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinf…☆11Updated 4 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆74Updated 2 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 8 years ago
- Online demo of DRLViz, an interactive tool to understand decisions and memory in Deep Reinforcement Learning☆15Updated 2 years ago
- Reinforcement learning in TensorFlow 2☆22Updated 3 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- Deep RL Bootcamp solutions☆35Updated 7 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Updated 7 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 5 years ago
- A Python 3 Bandit Visualization Package☆11Updated 7 years ago
- RLtime is a reinforcement learning library focused on state-of-the-art q-learning algorithms and features☆141Updated 5 years ago
- A Python library for reinforcement learning using Bayesian approaches☆54Updated 10 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- An interface with micropolis for city-building agents, packaged as an OpenAI gym environment☆156Updated 5 months ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Updated 6 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 7 years ago
- Some hard problems for reinforcement learning.☆31Updated 6 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated 2 years ago
- Simulation of Great filter concept in Fermi Paradox using RL, GA and Tensorflow.js☆30Updated 7 years ago
- A game theory framework with examples and algorithms☆73Updated 6 years ago
- Dataset for the spaceship task from "Metacontrol for Adaptive Imagination-Based Optimization"☆56Updated 8 years ago