uidilr/ppo_tf

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/uidilr/ppo_tf)

uidilr / ppo_tf

Implementation of proximal policy optimization(PPO) with tensorflow

☆35

Alternatives and similar repositories for ppo_tf

Users that are interested in ppo_tf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

takuseno / ppo
View on GitHub
Proximal Policy Optimization implementation with TensorFlow
☆108Oct 9, 2018Updated 7 years ago
shareeff / PPO
View on GitHub
Tensorflow implementation of proximal policy optimization (PPO) algorithm
☆13Feb 28, 2018Updated 8 years ago
magnusja / ppo
View on GitHub
Proximal Policy Optimization with TensorFlow and OpenAI Gym
☆19Mar 31, 2018Updated 8 years ago
PacktPublishing / Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL
View on GitHub
Hands-On Reinforcement Learning with TensorFlow & TRFL
☆14Jan 18, 2021Updated 5 years ago
LuEE-C / PPO-Keras
View on GitHub
My implementation of the Proximal Policy Optisation algorithm using Keras as a backend
☆88Nov 15, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
YRussac / WeightedLinearBandits
View on GitHub
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Nov 14, 2019Updated 6 years ago
AnujMahajanOxf / VIREL
View on GitHub
Code for VIREL: A Variational Inference Framework for Reinforcement Learning
☆14Dec 1, 2019Updated 6 years ago
DartML / PPO-Stein-Control-Variate
View on GitHub
Proximal Policy Optimization with Stein Control Variates:
☆33Feb 12, 2018Updated 8 years ago
CR-Gjx / RIA
View on GitHub
TensorFlow implementation of "A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Le…
☆16Jul 2, 2022Updated 4 years ago
sgiguere / RobinHood-NeurIPS-2019
View on GitHub
Implementation of safe offline bandit algorithms.
☆10Oct 27, 2019Updated 6 years ago
jianing-sun / Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-
View on GitHub
Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…
☆38Feb 5, 2019Updated 7 years ago
hongzimao / a3c
View on GitHub
Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
☆24Apr 20, 2017Updated 9 years ago
eyounx / PRR
View on GitHub
Meta-Reinforcement Learning with Policy Residual Representation
☆11Aug 15, 2019Updated 6 years ago
bsivanantham / GAE
View on GitHub
Reinforcement learning algorithms with Generalized Advantage Estimation
☆22Jun 6, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
takuseno / mvc-drl
View on GitHub
Cleanest deep reinforcement learning implementation based on Web MVC architecture with complete unit testings
☆12Jun 7, 2019Updated 7 years ago
YaoMarkMu / DOMINO_MB-MetaRL
View on GitHub
☆20Apr 5, 2023Updated 3 years ago
Maximellerbach / Image-Processing-using-AI
View on GitHub
in this repo, I create models to process image (upscale, debluring...)
☆15Oct 5, 2021Updated 4 years ago
InsaneMonster / telerl2021
View on GitHub
GitHub for the article Deep Reinforcement Learning for URLLC data management on top of scheduled eMBB traffic (Fabio Saggese, Luca Pasqua…
☆18Feb 18, 2021Updated 5 years ago
ahq1993 / Multimodal-Deep-Q-Network-for-Social-Human-Robot-Interaction
View on GitHub
Multimodal Deep Q-Network (MDQN) for modelling human-like social intelligence.
☆14Feb 23, 2017Updated 9 years ago
PaolaArdon / Salt-Pepper
View on GitHub
Robotics Project
☆46Mar 12, 2019Updated 7 years ago
hiwonjoon / maml-tensorflow
View on GitHub
This repository implements the paper, Model-Agnostic Meta-Leanring for Fast Adaptation of Deep Networks.
☆16Nov 3, 2017Updated 8 years ago
cair / rl
View on GitHub
☆13Sep 15, 2021Updated 4 years ago
microsoft / bonsai-common
View on GitHub
☆10Jul 20, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
WILAB-HIT / News
View on GitHub
☆11Mar 14, 2016Updated 10 years ago
akdev-tech / python-chatgpt
View on GitHub
ChatGPT without browser emulation
☆13Dec 12, 2022Updated 3 years ago
Applied-Machine-Learning-Lab / Diff-MSR
View on GitHub
Code for 'Diff-MSR: A Diffusion Model Enhanced Paradigm for Cold-Start Multi-Scenario Recommendation' accepted to WSDM 2024
☆14Aug 1, 2025Updated 11 months ago
lafmdp / HIDIL
View on GitHub
[NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"
☆12Nov 24, 2021Updated 4 years ago
tianxusky / Code-for-Error-Bounds-of-Imitating-Policies-and-Environments
View on GitHub
☆10Oct 15, 2020Updated 5 years ago
nric / ProximalPolicyOptimizationKeras
View on GitHub
This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.
☆12Sep 3, 2020Updated 5 years ago
colinpcurtis / datares_GANs
View on GitHub
pix2pix and Cycle GAN architectures for image style transfer
☆13May 27, 2021Updated 5 years ago
attashe / ModifiedBeamSampler
View on GitHub
Modified Beam Search with periodical restart
☆12Sep 12, 2024Updated last year
llq20133100095 / deep-tiaotiao
View on GitHub
用强化学习来玩微信跳一跳
☆12Jul 10, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
glampert / reverse-engineering-outlaws
View on GitHub
An attempt to reverse engineer custom file formats used by the game Outlaws from LucasArts.
☆16Nov 3, 2018Updated 7 years ago
veronicachelu / meta-learning
View on GitHub
Meta Reinforcement Learning Experiments
☆35Aug 22, 2017Updated 8 years ago
carpedm20 / a3c-tensorflow
View on GitHub
☆32Apr 27, 2017Updated 9 years ago
Azure-Samples / aihlsignited-medindexer
View on GitHub
Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search
☆15Jun 10, 2026Updated 3 weeks ago
cxxgtxy / POP3D
View on GitHub
Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
☆44Nov 8, 2018Updated 7 years ago
dreamness-dnalm / socket-transfer-file-and-message
View on GitHub
send and receive message and file by python3 socket
☆12May 24, 2018Updated 8 years ago
christophmark / bayesianfridge
View on GitHub
Sequential Monte Carlo sampler for PyMC2 models.
☆14Apr 4, 2018Updated 8 years ago