alexis-jacq / numpy_ARS
A 150-lines python code for Augmented Random Search (https://arxiv.org/abs/1803.07055) with numpy.
☆70Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for numpy_ARS
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated last year
- Direct Future Prediction (DFP ) in Keras☆109Updated 6 years ago
- Reason8.ai PyTorch solution for NIPS RL 2017 challenge☆84Updated 5 years ago
- Deep Reinforcement Learning☆17Updated 7 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 7 years ago
- An implementation of the Augmented Random Search algorithm☆421Updated 3 years ago
- Code for the paper "Learning to Act by Predicting the Future", Alexey Dosovitskiy and Vladlen Koltun, ICLR 2017☆150Updated 2 months ago
- A simple Gridworld environment for Open AI gym☆24Updated 6 years ago
- A reinforcement learning framework☆154Updated 5 years ago
- Using Pilco algorithm to find a controller for few robotic problems☆43Updated 9 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆193Updated 6 years ago
- Shared autonomy via deep reinforcement learning☆74Updated last year
- An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.☆29Updated 7 years ago
- ☆161Updated 7 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆51Updated 4 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 6 years ago
- Our NIPS 2017: Learning to Run source code☆56Updated last year
- Keras implementation of DQN on ViZDoom environment☆53Updated 8 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.☆121Updated 8 years ago
- trust region policy optimization base on gym and tensorflow, can run in distribution mode☆15Updated 7 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆101Updated 4 years ago
- Highly Modular and Scalable Reinforcement Learning☆114Updated 4 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 6 years ago
- Implementation of Deepmind's Neural Episodic Control☆57Updated 6 years ago
- AI learning to walk in gym's BipedalWalker environment.☆66Updated 7 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆153Updated 7 years ago
- Combining deep learning and reinforcement learning.☆81Updated 3 years ago
- ☆99Updated 8 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆79Updated 5 years ago