keon/policy-gradient

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/keon/policy-gradient)

keon / policy-gradient

Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras

☆160

Alternatives and similar repositories for policy-gradient

Users that are interested in policy-gradient are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

keon / deep-q-learning
View on GitHub
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
☆1,325May 18, 2026Updated 2 months ago
keon / deep-nlp
View on GitHub
[In-Progress] Mini implementations of deep learning algorithms for natural language processing in PyTorch
☆30Mar 30, 2017Updated 9 years ago
keon / seq2seq-wgan
View on GitHub
Improved Training of Wasserstein GANs for Neural Machine Translation
☆11Dec 11, 2017Updated 8 years ago
keon / CodeGAN
View on GitHub
[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks
☆75Jan 7, 2017Updated 9 years ago
xlnwel / model-free-algorithms
View on GitHub
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
☆63Apr 5, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gabrielgarza / openai-gym-policy-gradient
View on GitHub
Reinforcement Learning using Policy Gradient to solve OpenAI Gym games
☆112Dec 13, 2017Updated 8 years ago
keon / pytorch-exercises
View on GitHub
Pytorch exercises
☆98Mar 9, 2017Updated 9 years ago
kimhc6028 / policy-gradient-importance-sampling
View on GitHub
Policy gradient reinforcement learning algorithm with importance sampling
☆33Oct 6, 2017Updated 8 years ago
keon / text-wgan
View on GitHub
Improved Training of Wasserstein GANs for Text Generation
☆23Nov 26, 2017Updated 8 years ago
tokb23 / dqn
View on GitHub
DQN implementation in Keras + TensorFlow + OpenAI Gym
☆158Jan 23, 2018Updated 8 years ago
keon / deepsort
View on GitHub
Deep Learning the Sorting Algorithm
☆12Dec 11, 2016Updated 9 years ago
keon / deeptravel
View on GitHub
Solving Traveling Salesman Problem (TSP) using Deep Learning
☆34Dec 25, 2016Updated 9 years ago
yanpanlau / DDPG-Keras-Torcs
View on GitHub
Using Keras and Deep Deterministic Policy Gradient to play TORCS
☆727Dec 4, 2017Updated 8 years ago
jjkke88 / RL_toolbox
View on GitHub
reinfore learning tool box, contains trpo, a3c algorithm for continous action space
☆41Jan 27, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
keon / Seq2Seq-Tensorflow
View on GitHub
[In-Progress] Tensorflow implementation of Sequence to Sequence Learning with Neural Networks
☆18Sep 8, 2016Updated 9 years ago
yrlu / reinforcement_learning
View on GitHub
Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.
☆153May 28, 2023Updated 3 years ago
LuEE-C / PPO-Keras
View on GitHub
My implementation of the Proximal Policy Optisation algorithm using Keras as a backend
☆88Nov 15, 2019Updated 6 years ago
zafarali / policy-gradient-methods
View on GitHub
Modular PyTorch implementation of policy gradient methods
☆24Nov 15, 2018Updated 7 years ago
rlcode / reinforcement-learning
View on GitHub
Minimal and Clean Reinforcement Learning Examples
☆3,657Jun 12, 2026Updated last month
uber-research / atari-model-zoo
View on GitHub
A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…
☆205May 25, 2020Updated 6 years ago
reinforcement-learning-kr / pg_travel
View on GitHub
Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
☆371Aug 1, 2019Updated 6 years ago
Kaixhin / GUDRL
View on GitHub
Generalised UDRL
☆37May 12, 2022Updated 4 years ago
krasheninnikov / max-causal-ent-irl
View on GitHub
Maximum Causal Entropy Inverse Reinforcement Learning
☆49Nov 24, 2018Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
pokaxpoka / netrand
View on GitHub
Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020
☆57Apr 27, 2020Updated 6 years ago
awjuliani / oreilly-rl-tutorial
View on GitHub
Contains Jupyter notebooks associated with the "Deep Reinforcement Learning Tutorial" tutorial given at the O'Reilly 2017 NYC AI Conferen…
☆276Jan 16, 2020Updated 6 years ago
hs105 / multi-agent-reinforcement-learning
View on GitHub
This maintains a reading list for multi-agent reinforcement learning
☆16Jun 8, 2017Updated 9 years ago
neka-nat / inv_rl
View on GitHub
Inverse Reinforcement Learning Argorithms
☆52May 13, 2019Updated 7 years ago
thundergolfer / text-classify-with-cnn
View on GitHub
Easy to follow text classifying implementation using a Conv. Neural Network (Tensorflow)
☆14Apr 22, 2017Updated 9 years ago
hashbangCoder / Neural-Conversational-Model
View on GitHub
Tensorflow Implementation of Neural Conversational Model by Vinyals et.al.
☆12Sep 3, 2016Updated 9 years ago
andyliu42 / Counterfactual_Regret_Minimization_Python
View on GitHub
Counterfactual Regret Minimization (CFR) sample code in Python
☆14Apr 16, 2019Updated 7 years ago
floringogianu / categorical-dqn
View on GitHub
A working implementation of the Categorical DQN (Distributional RL).
☆95Apr 7, 2018Updated 8 years ago
fbora / tic-tac-GO_ZERO
View on GitHub
Implementation of Alpha Go Zero algorithm for the game of tic-tac-toe
☆16Nov 4, 2017Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
avinashu1980 / Intro_Python_Gurobi
View on GitHub
☆16May 11, 2017Updated 9 years ago
flowersteam / geppg
View on GitHub
☆36Aug 10, 2018Updated 7 years ago
hohoCode / cgx
View on GitHub
UltraFast GPU Grammar eXtractor for Machine Translation (He et al., TACL 2015 & NAACL 2013)
☆12Jun 19, 2015Updated 11 years ago
zhongwen / predictron
View on GitHub
Tensorflow implementation of "The Predictron: End-To-End Learning and Planning"
☆289Jan 20, 2017Updated 9 years ago
j-min / WikiExtractor_To_the_one_text
View on GitHub
Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)
☆16Dec 23, 2016Updated 9 years ago
saeta / tensorflow-workshop
View on GitHub
Slides and code from our TensorFlow workshop.
☆24Jun 26, 2017Updated 9 years ago
Alexander-H-Liu / Policy-Gradient-and-Actor-Critic-Keras
View on GitHub
Simple implementation of Policy Gradient (PG)/ Actor-Critic with keras
☆29Dec 20, 2017Updated 8 years ago