georgwiese / 2048-rl
2048 Reinforcement Learning
☆50Updated 6 years ago
Alternatives and similar repositories for 2048-rl:
Users that are interested in 2048-rl are comparing it to the libraries listed below
- A Deep Learning AI for 2048 (2048:94.15%, 4096:78.48%, 8192: 34.5% 16384: 0.177%)☆150Updated 6 years ago
- Reinforcement Learning for Super Mario Bros using A3C on GPU☆37Updated 7 years ago
- This is the code for "How Does DeepMind's AlphaGo Zero Work?" Siraj Raval on Youtube☆122Updated 7 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆99Updated 6 years ago
- Simplest Version of playing Atari with Deep Q Learning in Tensorflow☆158Updated 7 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆68Updated 8 years ago
- A Policy Network in Tensorflow to classify chess moves☆17Updated 8 years ago
- Use Asynchronous advantage actor-critic algorithm (A3C) to play Flappy Bird using Keras☆39Updated 7 years ago
- 2nd place solution of NIPS2017 LearningToRun Competition.☆124Updated 2 years ago
- ☆39Updated 7 years ago
- A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well☆92Updated 7 years ago
- Add-on for OpenAI Gym that supports automatic downloading of user environments.☆45Updated 7 years ago
- Tensorflow implementation of SqueezeNet.☆129Updated 6 years ago
- PyTorch implementation of "Asynchronous advantage actor-critic"☆19Updated 6 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆58Updated 11 months ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆31Updated 8 years ago
- A collection of DL experiments and notes☆135Updated 6 years ago
- alphagomoku☆60Updated 7 years ago
- An implementation of improved AlphaGo algorithm in the game of Gomoku.☆57Updated 5 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 4 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 6 years ago
- Tensorflow implementation of A3C algorithm☆46Updated 7 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆266Updated 5 years ago
- Neural Networks For Playing Pong☆77Updated 8 years ago
- Chess position evaluation using neural networks☆26Updated 5 years ago
- An implementation of the AlphaZero algorithm for chess☆33Updated 2 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Updated 8 years ago
- ☆53Updated 8 years ago
- 💡 Repo of learning notes in DRL and DL, theory, codes, models and notes maybe.☆102Updated 6 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Updated 7 years ago