JannesKlaas / sometimes_deep_sometimes_learning
A collection of DL experiments and notes
☆135Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for sometimes_deep_sometimes_learning
- This is the code for "How Does DeepMind's AlphaGo Zero Work?" Siraj Raval on Youtube☆122Updated 7 years ago
- Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras☆159Updated 4 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆267Updated 5 years ago
- Simplest Version of playing Atari with Deep Q Learning in Tensorflow☆160Updated 7 years ago
- Reinforcement Learning in Python☆107Updated 4 years ago
- Reinforcement Learning with Goals☆170Updated 5 years ago
- Contains Jupyter notebooks associated with the "Deep Reinforcement Learning Tutorial" tutorial given at the O'Reilly 2017 NYC AI Conferen…☆273Updated 4 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- random search, hill climbing, policy gradient☆140Updated 6 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆99Updated 6 years ago
- Using Keras and Deep Q-Network to Play FlappyBird☆437Updated 5 years ago
- This code is written for the blogs☆271Updated 8 years ago
- Deep RL Algorithms implemented for UC Berkeley's CS 294-112: Deep Reinforcement Learning☆140Updated 7 years ago
- DQN implementation in Keras + TensorFlow + OpenAI Gym☆158Updated 6 years ago
- Reinforcement Learning and Transfer Learning based StarCraft Micromanagement☆101Updated 7 years ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆66Updated 6 years ago
- A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well☆91Updated 7 years ago
- ☆215Updated 7 years ago
- An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.☆326Updated 6 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆62Updated 7 years ago
- ☆39Updated 7 years ago
- Accompanying repository for Let's make a DQN / A3C series.☆391Updated 6 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆41Updated 8 years ago
- Code that accompanies my talk at TF Dev Summit 2016☆337Updated 6 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆416Updated 5 years ago
- Jupyter notebook running through basic examples of Distributed TensorFlow☆71Updated 3 years ago
- This is the code for "How to Learn from Little Data - Intro to Deep Learning #17' by Siraj Raval on YouTube☆141Updated 7 years ago
- ☆77Updated 7 years ago
- Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…☆75Updated 7 years ago
- ☆53Updated 7 years ago