takuseno/ppo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/takuseno/ppo)

takuseno / ppo

Proximal Policy Optimization implementation with TensorFlow

☆108

Alternatives and similar repositories for ppo

Users that are interested in ppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uidilr / ppo_tf
View on GitHub
Implementation of proximal policy optimization(PPO) with tensorflow
☆35Feb 10, 2018Updated 8 years ago
takuseno / mvc-drl
View on GitHub
Cleanest deep reinforcement learning implementation based on Web MVC architecture with complete unit testings
☆12Jun 7, 2019Updated 7 years ago
hercky / ACER_tf
View on GitHub
Implementation for ACER in tensorflow and sonnet by deepmind
☆11Aug 28, 2017Updated 8 years ago
bsivanantham / GAE
View on GitHub
Reinforcement learning algorithms with Generalized Advantage Estimation
☆22Jun 6, 2018Updated 8 years ago
jw1401 / PPO-Tensorflow-2.0
View on GitHub
Proximal Policy Optimization with Tensorflow 2.0
☆32Oct 14, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LuEE-C / PPO-Keras
View on GitHub
My implementation of the Proximal Policy Optisation algorithm using Keras as a backend
☆88Nov 15, 2019Updated 6 years ago
reinforcement-learning-kr / pg_travel
View on GitHub
Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
☆371Aug 1, 2019Updated 6 years ago
renweiya / RFQ-RFAC
View on GitHub
Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning
☆17Mar 11, 2020Updated 6 years ago
google-research / batch-ppo
View on GitHub
Efficient Batched Reinforcement Learning in TensorFlow
☆979Jan 11, 2019Updated 7 years ago
EmbersArc / PPO
View on GitHub
PPO implementation for OpenAI gym environment based on Unity ML Agents
☆150Mar 17, 2018Updated 8 years ago
kkweon / A3C-Tensorflow
View on GitHub
Simple Example A3C Reinforcement Learning Algorithm in Tensorflow
☆13May 23, 2017Updated 9 years ago
pat-coady / trpo
View on GitHub
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
☆364Jun 2, 2020Updated 6 years ago
YoongiKim / Rider-PPO
View on GitHub
Rider Reinforcement Learning Environment with Proximal Policy Optimization
☆14Sep 5, 2019Updated 6 years ago
wooridle / DeepRL-PPO-tutorial
View on GitHub
This repository contains tutorial material on Doing DeepRL with PPO in GDG DevFest 2017 Seoul.
☆22Nov 20, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
magnusja / ppo
View on GitHub
Proximal Policy Optimization with TensorFlow and OpenAI Gym
☆19Mar 31, 2018Updated 8 years ago
liampetti / A3C-LSTM
View on GitHub
A3C-LSTM algorithm tested on CartPole OpenAI Gym environment
☆48Jul 4, 2018Updated 8 years ago
llSourcell / proximal_policy_optimization
View on GitHub
This is the code for "War Robots" by Siraj Raval on Youtube
☆16Dec 22, 2017Updated 8 years ago
kinwo / deeprl-continuous-control
View on GitHub
Learning Continuous Control in Deep Reinforcement Learning
☆14Nov 24, 2018Updated 7 years ago
alexis-jacq / Pytorch-DPPO
View on GitHub
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆184Mar 25, 2018Updated 8 years ago
facebookresearch / CollaQ
View on GitHub
A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"
☆132Aug 14, 2023Updated 2 years ago
ramp-kits / rl_simulator
View on GitHub
Model-based reinforcement learning (generative simulator models and planning agents)
☆16Mar 13, 2026Updated 4 months ago
Kyushik / DRL
View on GitHub
Repository for codes of 'Deep Reinforcement Learning'
☆218Oct 4, 2019Updated 6 years ago
pmlg / deep-rl-bootcamp
View on GitHub
Deep RL Bootcamp solutions
☆33Nov 20, 2017Updated 8 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Anjum48 / rl-examples
View on GitHub
Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow
☆103Aug 3, 2020Updated 5 years ago
seungjaeryanlee / playing-hard-exploration-games-by-watching-youtube
View on GitHub
[WIP] Playing Hard Exploration Games by Watching YouTube (Aytar et al., 2018)
☆12Jan 31, 2019Updated 7 years ago
mihahauke / deep_rl_vizdoom
View on GitHub
Deep reinforcement learning in ViZDoom (using Tensorflow)
☆19Jan 25, 2018Updated 8 years ago
hongzimao / a3c
View on GitHub
Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
☆24Apr 20, 2017Updated 9 years ago
DartML / PPO-Stein-Control-Variate
View on GitHub
Proximal Policy Optimization with Stein Control Variates:
☆33Feb 12, 2018Updated 8 years ago
ASzot / ppo-pytorch
View on GitHub
Proximal policy optimization in PyTorch. Easy to read and understand.
☆51Oct 30, 2020Updated 5 years ago
phoglenix / ScExtractor
View on GitHub
High granularity and accuracy Starcraft replay data extractor which outputs to a database
☆14Feb 18, 2022Updated 4 years ago
tpbarron / pytorch-ppo
View on GitHub
Proximal Policy Optimization in PyTorch
☆39Dec 10, 2017Updated 8 years ago
roboticsleeds / mujoco-ur5-model
View on GitHub
Mujoco Model for UR5-Ridgeback-Robotiq Robot
☆48May 24, 2019Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
p-kar / a2c-acktr-vizdoom
View on GitHub
A2C, ACKTR and A2T implementations for ViZDoom
☆10Dec 18, 2017Updated 8 years ago
jaimeyzzz / impala_horovod_gym
View on GitHub
☆10Sep 20, 2018Updated 7 years ago
slowbull / DDPG
View on GitHub
Tensorflow implementation of Deep Deterministic Policy Gradients
☆19Mar 27, 2017Updated 9 years ago
nikhilbarhate99 / PPO-PyTorch
View on GitHub
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
☆2,366Jul 9, 2024Updated 2 years ago
ChintanTrivedi / rl-bot-football
View on GitHub
An RL agent for the Google Football environment
☆95Jun 19, 2021Updated 5 years ago
mehdiboubnan / Deep-Reinforcement-Learning-applied-to-DOOM
View on GitHub
DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM
☆94Feb 8, 2021Updated 5 years ago
awslabs / damoos
View on GitHub
DAMON-based Optimal Operation Schemes
☆17Sep 5, 2024Updated last year