BenderV / PrisonerDilemma
Finding Game Theory equilibrium with machine learning agents : prisoner's dilemma
☆19Updated 8 years ago
Alternatives and similar repositories for PrisonerDilemma:
Users that are interested in PrisonerDilemma are comparing it to the libraries listed below
- IRL implementation based on Norvig's AIMA code.☆13Updated 10 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- A Python 3 Bandit Visualization Package☆11Updated 7 years ago
- Fork of cma-es library by Nikolaus Hansen☆11Updated 7 years ago
- ☆8Updated 6 years ago
- Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamic…☆11Updated 3 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- A Python library for reinforcement learning using Bayesian approaches☆54Updated 9 years ago
- Visual Question Answering system's different implementations☆10Updated 7 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- ☆13Updated 9 years ago
- ☆11Updated 3 years ago
- Learning algorithms introduced in "A PAC-Bayes Sample Compression Approach to Kernel Methods" (ICML 2011)☆9Updated 10 years ago
- stochs: fast stochastic solvers for machine learning in C++ and Cython☆26Updated 2 years ago
- A tool for experimenting with evolutionary optimization methods for machine learning algorithms, by distributing the workload over a larg…☆13Updated 6 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 8 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 7 years ago
- Repository for "Known Unknowns: Uncertainty Quality in Bayesian Neural Networks" paper.☆12Updated 7 years ago
- Sample code for generative recurrent autoencoders.☆25Updated 8 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- ADENINE: A Data ExploratioN PipelINE☆15Updated 6 years ago
- Capturing Structure Implicitly from Noisy Time-Series having Limited DataUpdated 6 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 6 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆11Updated 4 years ago
- This uses HydroSphere to expose a Python machine learning preventive maintenance model for truck brake maintenance☆15Updated 4 years ago
- Low-rank Highway Networks☆13Updated 8 years ago
- ☆26Updated 5 years ago
- Markov Decision Processes in Python☆15Updated 6 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆30Updated 7 years ago
- Gibbs sampler for for a Naive Bayes document classifier☆24Updated 12 years ago