dohmatob / gambling
Code for paper "A simple algorithm for computing Nash-equilibria in incomplete information games"
☆10Updated 7 years ago
Related projects: ⓘ
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 7 years ago
- Tensorflow Implementation of Programmable Agents☆36Updated 6 years ago
- ☆14Updated 8 years ago
- Model Zoo for Deep Reinforcement Learning☆14Updated 8 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆43Updated 6 years ago
- ☆29Updated 7 years ago
- Deterministic Policy Gradient using torch7☆44Updated 8 years ago
- Code to implement SIMILE algorithm from the paper entitled "Smooth Imitation Learning for Online Sequence Prediction" from ICML 2016☆13Updated 8 years ago
- Model-Free Episodic Control☆15Updated 7 years ago
- MXNet implementation of Deep Q-learning☆34Updated 6 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 8 years ago
- Atari gauntlet for RL agents☆29Updated 7 years ago
- ☆30Updated 7 years ago
- DDPG on OpenAI Gym Pendulum☆19Updated 8 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Updated 7 years ago
- ☆38Updated 7 years ago
- A very simple variant of adversarial training that yields excellent results on MNIST☆12Updated 8 years ago
- reinforcement learning. policy gradient. PCL☆38Updated 7 years ago
- ☆17Updated 7 years ago
- ☆28Updated 5 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆23Updated 6 years ago
- Reinforcement learning with a convolutional neural network.☆36Updated 9 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆31Updated 6 years ago
- Asynchronous Advantage Actor Critic☆21Updated 8 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Updated 6 years ago
- Scripts to generate a dataset with static frames from the Arcade Learning Environment☆18Updated 10 years ago
- Solves AI, transcends reality, infiltrates your mind☆36Updated 7 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Implementation of A Distributional Perspective on Reinforcement Learning☆34Updated 7 years ago
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆18Updated 7 years ago