TianhongDai/distributed-ppo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TianhongDai/distributed-ppo)

TianhongDai / distributed-ppo

This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).

☆62

Alternatives and similar repositories for distributed-ppo

Users that are interested in distributed-ppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alexis-jacq / Pytorch-DPPO
View on GitHub
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆184Mar 25, 2018Updated 8 years ago
oswsnqc / Tensorflow-DPPO
View on GitHub
self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow
☆12Sep 1, 2017Updated 8 years ago
seolhokim / DistributedRL-Pytorch-Ray
View on GitHub
Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)
☆27Jun 8, 2022Updated 4 years ago
itsTAMART / UAV-RL-environment
View on GitHub
A simple and fast 2D RL environment with obstacles to learn navigation.
☆23Sep 12, 2019Updated 6 years ago
LiuShuai26 / Distributed-RL
View on GitHub
Distributed DRL by Ray and TensorFlow Tutorial.
☆10Dec 26, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
UesugiErii / tf2-PPO-atari
View on GitHub
Use tensorflow2 achieve PPO to play atari game
☆13Oct 25, 2019Updated 6 years ago
Jiankai-Sun / Proximal-Policy-Optimization-Pytorch
View on GitHub
Proximal Policy Optimization(PPO) Algorithm and its distributed implementation in Pytorch
☆16Nov 2, 2017Updated 8 years ago
JiahengHu / GLSO
View on GitHub
Official implementation of GLSO: Robot Design Automation (CoRL 2022)
☆11Sep 21, 2022Updated 3 years ago
ikostrikov / pytorch-a2c-ppo-acktr-gail
View on GitHub
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…
☆3,905May 29, 2022Updated 4 years ago
mengf1 / DHER
View on GitHub
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
☆65Nov 8, 2019Updated 6 years ago
alex-petrenko / signal-slot
View on GitHub
Qt-like event loops, signals and slots for communication across threads and processes in Python
☆14Mar 26, 2024Updated 2 years ago
jsztompka / MultiAgent-PPO
View on GitHub
Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis
☆32Jan 9, 2019Updated 7 years ago
ycz0512 / SAC-HER
View on GitHub
Implementation of Soft Actor-Critic with Hindsight Experience Replay
☆21Oct 23, 2020Updated 5 years ago
wotmd5731 / dqn
View on GitHub
pytorch, noisy_distributional_double_dueling_PER_RNN_CNN...CartPole-v1 , Acrobot-v1, MountainCar-v0
☆14Mar 19, 2018Updated 8 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
deepsense-ai / Distributed-BA3C
View on GitHub
☆55Dec 7, 2022Updated 3 years ago
JunhongXu / ppo-pytorch
View on GitHub
☆20Apr 10, 2018Updated 8 years ago
bsivanantham / GAE
View on GitHub
Reinforcement learning algorithms with Generalized Advantage Estimation
☆22Jun 6, 2018Updated 8 years ago
Yuxing-Wang-THU / ModularEvoGym
View on GitHub
A modified benchmark for designing and controlling 2D Voxel-based Soft Robots
☆41Nov 18, 2023Updated 2 years ago
TMats / survey
View on GitHub
Summary of Paper Survey
☆15Oct 16, 2019Updated 6 years ago
Steven-Ho / VALOR
View on GitHub
Implementation of VALOR (Variational Option Discovery Algorithms)
☆10Jun 28, 2019Updated 7 years ago
nnaisense / MAGE
View on GitHub
Learning Action-Value Gradients in Model-based Policy Optimization
☆32Sep 7, 2021Updated 4 years ago
xingyul / revolver
View on GitHub
REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer (ICML 2022 Long Oral)
☆27Sep 10, 2022Updated 3 years ago
lehduong / Job-Scheduling-with-Reinforcement-Learning
View on GitHub
Learning in Noisy MDP (which is governed by stochastic, exogenous input processes) with input-dependent baseline
☆10Aug 7, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TianhongDai / self-imitation-learning-pytorch
View on GitHub
This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.
☆67Nov 4, 2018Updated 7 years ago
haje01 / distper
View on GitHub
Distributed Priortized Experience Replay
☆10Aug 8, 2018Updated 7 years ago
deep-skill-chaining / deep-skill-chaining
View on GitHub
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆30Sep 24, 2019Updated 6 years ago
cxxgtxy / deeprl-baselines
View on GitHub
Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…
☆35Aug 23, 2018Updated 7 years ago
apexrl / autombpo
View on GitHub
Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>
☆13Nov 16, 2021Updated 4 years ago
cxxgtxy / POP3D
View on GitHub
Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
☆44Nov 8, 2018Updated 7 years ago
IouJenLiu / HTS-RL
View on GitHub
☆21Dec 22, 2020Updated 5 years ago
mklissa / PPOC
View on GitHub
Proximal Policy Option-Critic
☆26Jan 4, 2019Updated 7 years ago
dgriff777 / a3c_continuous
View on GitHub
A continuous action space version of A3C LSTM in pytorch plus A3G design
☆259Oct 11, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
fedingo / Hierarchical-DQN
View on GitHub
Simple implementation of the model presented in Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic …
☆16Jan 22, 2019Updated 7 years ago
Charmve / BallPlate
View on GitHub
板球控制系統/滾球系統/BallPlate 2017年全国大学生电子设计竞赛B题全国二等奖作品
☆15May 27, 2024Updated 2 years ago
TianhongDai / reinforcement-learning-algorithms
View on GitHub
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Duel…
☆695Dec 18, 2025Updated 7 months ago
hu-po / pySACQ
View on GitHub
PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)
☆39Feb 13, 2021Updated 5 years ago
jerrywiston / RL-Mapless-Navigation
View on GitHub
☆21Jun 7, 2020Updated 6 years ago
ASzot / ppo-pytorch
View on GitHub
Proximal policy optimization in PyTorch. Easy to read and understand.
☆51Oct 30, 2020Updated 5 years ago
uidilr / ppo_tf
View on GitHub
Implementation of proximal policy optimization(PPO) with tensorflow
☆35Feb 10, 2018Updated 8 years ago