mhubii / ppo_libtorch
C++ implementation of Proximal Policy Optimization
☆81Updated 2 years ago
Alternatives and similar repositories for ppo_libtorch:
Users that are interested in ppo_libtorch are comparing it to the libraries listed below
- ☆49Updated 5 years ago
- Code for the Black-DROPS algorithm: "Black-Box Data-efficient Policy Search for Robotics", IROS 2017/ICRA 2018☆65Updated 3 years ago
- Shared autonomy via deep reinforcement learning☆78Updated 2 years ago
- ☆55Updated 2 years ago
- A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)☆95Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆61Updated 6 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆150Updated 4 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆189Updated 6 years ago
- Augmented environments with RL☆103Updated 6 years ago
- My reading list for model-based control☆156Updated 6 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 4 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆86Updated 5 years ago
- This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement le…☆215Updated 5 years ago
- Autogenic differentiation☆51Updated 2 years ago
- CuLE: A CUDA port of the Atari Learning Environment (ALE)☆237Updated 2 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆34Updated 2 years ago
- ☆70Updated 5 years ago
- A C++ implementation of the asynchronous advantage actor-critic (A3C) algorithm☆22Updated 5 years ago
- Implement A3C for Mujoco gym envs☆72Updated 7 years ago
- A C++/Python simulator package for reinforcement learning☆85Updated 6 years ago
- ☆54Updated 7 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆21Updated 6 years ago
- NIPS 2017 Value Prediction Network☆166Updated 7 years ago
- Plot Tensorflow Summary Event in a Beautiful Way 🌈☆68Updated 6 years ago
- The Differentiable Cross-Entropy Method☆126Updated 4 years ago
- ☆68Updated 3 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆183Updated 7 years ago
- ☆47Updated 4 years ago
- OpenAI Gym environment for DART robotics simulator.☆22Updated 7 years ago