mhubii / ppo_libtorch
C++ implementation of Proximal Policy Optimization
☆73Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ppo_libtorch
- ☆47Updated 4 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆103Updated 5 years ago
- [ICLR 2018] Tensorflow/Keras code for Semi-parametric Topological Memory for Navigation☆103Updated 5 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆33Updated last year
- My reading list for model-based control☆150Updated 5 years ago
- Plot Tensorflow Summary Event in a Beautiful Way 🌈☆68Updated 5 years ago
- ☆71Updated 5 years ago
- Machine Learning Course Project Skoltech 2018☆108Updated 5 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆149Updated 4 years ago
- Highly Modular and Scalable Reinforcement Learning☆114Updated 4 years ago
- Implementation of the Deep Deterministic Policy Gradient(DDPG) in bullet Gym using pytorch☆41Updated 6 years ago
- ☆32Updated 6 years ago
- Collection of Physics-based simulations☆66Updated 2 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆94Updated 4 years ago
- Code for the Black-DROPS algorithm: "Black-Box Data-efficient Policy Search for Robotics", IROS 2017/ICRA 2018☆64Updated 3 years ago
- ☆71Updated 2 years ago
- Augmented environments with RL☆102Updated 5 years ago
- ☆53Updated 6 years ago
- A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env☆71Updated 7 years ago
- Proximal Policy Optimization in PyTorch☆38Updated 6 years ago
- Guided-Meta Policy Search☆41Updated last year
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆101Updated 4 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- ☆29Updated 7 years ago
- ☆68Updated 3 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆83Updated 4 years ago
- ☆53Updated 2 years ago
- A C++ implementation of the asynchronous advantage actor-critic (A3C) algorithm☆22Updated 4 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 6 years ago