MorvanZhou / my_research
机器学习的研究
☆36Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for my_research
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆54Updated 5 years ago
- Reinforcement learning with docker and torcs☆20Updated 7 years ago
- Deep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, …☆36Updated 6 years ago
- PyTorch implementation of "Asynchronous advantage actor-critic"☆21Updated 6 years ago
- Minimal implementations of reinforcement learning algorithms by Tensorflow☆29Updated 6 years ago
- ☆53Updated 7 years ago
- BMVC 2017: Virtual to Real Reinforcement Learning for Autonomous Driving☆44Updated 5 years ago
- 2048 playing agent using deep Q-learning in Matlab.☆38Updated 8 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 7 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Updated 6 years ago
- reinforcement learning ddpg code. follow deepmind papers.☆60Updated 6 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆62Updated 7 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 6 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework☆36Updated 6 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- Using Pilco algorithm to find a controller for few robotic problems☆43Updated 9 years ago
- ☆408Updated 6 years ago
- Jointly learning policies and latent representations for driver behavior.☆15Updated 7 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated last year
- ☆117Updated 4 years ago
- A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env☆71Updated 7 years ago
- ☆11Updated last year
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆32Updated 8 years ago
- just for fun☆23Updated 7 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆193Updated 6 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆44Updated 4 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆180Updated 6 years ago
- Reinforcement Learning in Python☆107Updated 4 years ago