llSourcell / alphago_demo
This is the code for "How Does DeepMind's AlphaGo Zero Work?" Siraj Raval on Youtube
☆122Updated 6 years ago
Related projects: ⓘ
- Unofficial attempt to rebuild AlphaGo Zero☆57Updated 4 months ago
- Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th …☆340Updated last year
- Neural Networks For Playing Pong☆77Updated 8 years ago
- AlphaGo-paper☆54Updated 5 years ago
- Minimalistic AlphaGoZero-like Engine☆275Updated 6 years ago
- BetaGo: AlphaGo for the masses, live on GitHub.☆679Updated 3 years ago
- This is the code for "A Guide to DeepMind's StarCraft AI Environment" by Siraj Raval on Youtube☆211Updated 3 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆265Updated 4 years ago
- A student implementation of Alpha Go Zero☆276Updated 6 years ago
- This is the Code for "Deep Q Learning - The Math of Intelligence #9" By Siraj Raval on Youtube☆162Updated 6 years ago
- A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well☆91Updated 7 years ago
- Game AI for Machine Learning for Hackers #3☆156Updated 7 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆100Updated 6 years ago
- Reinforcement Learning with Goals☆170Updated 4 years ago
- A collection of DL experiments and notes☆135Updated 5 years ago
- Contains Jupyter notebooks associated with the "Deep Reinforcement Learning Tutorial" tutorial given at the O'Reilly 2017 NYC AI Conferen…☆273Updated 4 years ago
- This is the code for "How to Learn from Little Data - Intro to Deep Learning #17' by Siraj Raval on YouTube☆141Updated 7 years ago
- Applying the deep learning techniques from Alpha Go to play tic-tac-toe☆162Updated 6 years ago
- ☆94Updated this week
- Using Keras and Deep Q-Network to Play FlappyBird☆434Updated 5 years ago
- Reversi reinforcement learning by AlphaGo Zero methods.☆676Updated last year
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 3 years ago
- An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.☆326Updated 6 years ago
- Deep Reinforcement Learning library for humans☆300Updated 7 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- This is the code for "Synthetic Gradients Explained" by Siraj Raval on Youtube☆61Updated 6 years ago
- random search, hill climbing, policy gradient☆138Updated 6 years ago
- This is the code for "Generative Artificial Intelligence" By Siraj Raval on Youtube☆102Updated 6 years ago
- This is the code for "AI that Creates AI" By Siraj Raval on Youtube☆66Updated 6 years ago
- ☆112Updated 7 years ago