integeruser / GA3C-cppLinks
A C++ implementation of the asynchronous advantage actor-critic (A3C) algorithm
☆22Updated 5 years ago
Alternatives and similar repositories for GA3C-cpp
Users that are interested in GA3C-cpp are comparing it to the libraries listed below
Sorting:
- Collection of Physics-based simulations☆68Updated 3 years ago
- OpenAI Gym environment for DART robotics simulator.☆22Updated 7 years ago
- Code for the Black-DROPS algorithm: "Black-Box Data-efficient Policy Search for Robotics", IROS 2017/ICRA 2018☆65Updated 3 years ago
- C++ implementation of Proximal Policy Optimization☆86Updated 2 years ago
- ☆184Updated 6 years ago
- OpenAI Gym environments using DART☆25Updated 2 years ago
- A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)☆96Updated 5 years ago
- ☆49Updated 5 years ago
- Implement A3C for Mujoco gym envs☆72Updated 7 years ago
- Experimental (stable, go here: https://github.com/benelot/pybullet-gym) repository of OpenAI Gym environments implemented with Bullet Phy…☆55Updated 3 years ago
- C++ Template Library to Predict, Control, Learn Behaviors, and Represent Learnable Knowledge using On/Off Policy Reinforcement Learning☆200Updated 8 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- ☆72Updated 6 years ago
- ☆54Updated 6 years ago
- trust region policy optimization base on gym and tensorflow, can run in distribution mode☆15Updated 8 years ago
- Openai Gym with Dart support☆141Updated 4 years ago
- The Winning Solution for the Learning To Run Challenge 2017☆61Updated 6 years ago
- Source code for our NIPS 2017 paper, InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations☆42Updated 7 years ago
- Tensorflow implementation of DeepMind paper - "Learning to Navigate in Complex Environments"☆63Updated 8 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆21Updated 7 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆183Updated 7 years ago
- Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains☆12Updated 7 years ago
- Yet another prioritized experience replay buffer implementation.☆48Updated 2 years ago
- ☆342Updated 7 years ago
- Proximal Policy Optimization in PyTorch☆39Updated 7 years ago
- Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines fro…☆154Updated 3 years ago
- Code for training policies based on paper Coordinated Multi-Agent Imitation Learning☆26Updated 7 years ago
- A C++ implementation of the derivative-free optimization algorithm CMA-ES.☆23Updated 11 years ago
- Automatically exported from code.google.com/p/rl-texplore-ros-pkg☆63Updated last year
- ☆140Updated 4 years ago