This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
☆63Jul 30, 2018Updated 7 years ago
Alternatives and similar repositories for distributed-ppo
Users that are interested in distributed-ppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Mar 25, 2018Updated 8 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Jun 8, 2022Updated 3 years ago
- A simple and fast 2D RL environment with obstacles to learn navigation.☆23Sep 12, 2019Updated 6 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Proximal Policy Optimization(PPO) Algorithm and its distributed implementation in Pytorch☆16Nov 2, 2017Updated 8 years ago
- Official implementation of GLSO: Robot Design Automation (CoRL 2022)☆11Sep 21, 2022Updated 3 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆108Jun 16, 2022Updated 3 years ago
- Implementation of Soft Actor-Critic with Hindsight Experience Replay☆21Oct 23, 2020Updated 5 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,898May 29, 2022Updated 3 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Nov 8, 2019Updated 6 years ago
- Qt-like event loops, signals and slots for communication across threads and processes in Python☆14Mar 26, 2024Updated 2 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆31Jan 9, 2019Updated 7 years ago
- pytorch, noisy_distributional_double_dueling_PER_RNN_CNN...CartPole-v1 , Acrobot-v1, MountainCar-v0☆14Mar 19, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Trust Region Policy Optimization (TRPO) in pure TensorFlow☆18Jun 7, 2018Updated 7 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆259Oct 11, 2024Updated last year
- ☆55Dec 7, 2022Updated 3 years ago
- ☆20Apr 10, 2018Updated 8 years ago
- A modified benchmark for designing and controlling 2D Voxel-based Soft Robots☆39Nov 18, 2023Updated 2 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Summary of Paper Survey☆15Oct 16, 2019Updated 6 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Jun 28, 2019Updated 6 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Learning in Noisy MDP (which is governed by stochastic, exogenous input processes) with input-dependent baseline☆11Aug 7, 2020Updated 5 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆67Nov 4, 2018Updated 7 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆22Jun 6, 2018Updated 7 years ago
- Multi-Agent Reinforcement Learning for Drones☆17May 14, 2022Updated 3 years ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Aug 23, 2018Updated 7 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer (ICML 2022 Long Oral)☆27Sep 10, 2022Updated 3 years ago
- Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>☆13Nov 16, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Proximal Policy Option-Critic☆26Jan 4, 2019Updated 7 years ago
- Cornell House Agent Learning Environment☆47Jun 22, 2022Updated 3 years ago
- Supplementary Material "Modeling and Control of Morphing Covers for the Adaptive Morphology of Humanoid Robots" published in IEEE Transac…☆13Dec 9, 2024Updated last year
- ☆21Dec 22, 2020Updated 5 years ago
- Simple implementation of the model presented in Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic …☆16Jan 22, 2019Updated 7 years ago
- ☆13Mar 18, 2024Updated 2 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Nov 8, 2018Updated 7 years ago