cxxgtxy/POP3D

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cxxgtxy/POP3D)

cxxgtxy / POP3D

Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization

☆44

Alternatives and similar repositories for POP3D

Users that are interested in POP3D are comparing it to the libraries listed below

Sorting:

quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
cjm715 / mgym
View on GitHub
A collection of multi-agent reinforcement learning OpenAI gym environments
☆46Jun 22, 2020Updated 5 years ago
voot-t / guide-actor-critic
View on GitHub
Keras implementation of guide actor-critic for continuous control
☆11Mar 12, 2018Updated 7 years ago
oswsnqc / Tensorflow-DPPO
View on GitHub
self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow
☆12Sep 1, 2017Updated 8 years ago
aary / sharp
View on GitHub
Some C++ libraries I implemented
☆15Feb 10, 2018Updated 8 years ago
AlexVeuthey / KinectFusion
View on GitHub
Master's semester project at EPFL: implement a depth map fusion algorithm for structured light.
☆11Jan 13, 2017Updated 9 years ago
clvrai / FeatureControlHRL-Tensorflow
View on GitHub
A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆32Oct 12, 2017Updated 8 years ago
miyosuda / rodentia
View on GitHub
3D learning environment with rigid body simulation for Linux/MacOSX
☆14Dec 24, 2021Updated 4 years ago
Kaixhin / Easy21
View on GitHub
Reinforcement Learning Assignment: Easy21
☆12Jul 4, 2016Updated 9 years ago
Kaixhin / GUDRL
View on GitHub
Generalised UDRL
☆37May 12, 2022Updated 3 years ago
cxxgtxy / deeprl-baselines
View on GitHub
Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…
☆35Aug 23, 2018Updated 7 years ago
flyyufelix / sonic_contest
View on GitHub
Source code for OpenAI Retro Contest for Sonic the Hedgehog
☆31Aug 20, 2018Updated 7 years ago
gliese581gg / A3C_tensorflow
View on GitHub
Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'
☆13Dec 23, 2016Updated 9 years ago
ShibiHe / Q-Optimality-Tightening
View on GitHub
This is my implementation of the Optimality Tightening
☆37Apr 26, 2017Updated 8 years ago
brentyi / toy-transformers-jax
View on GitHub
GPT implementation in Flax
☆18Jan 8, 2022Updated 4 years ago
bheijden / rex
View on GitHub
Rex is a JAX-powered framework for sim-to-real robotics.
☆52Jun 11, 2025Updated 8 months ago
apourchot / CEM-RL
View on GitHub
Combining Evolutionary Algorithms and deep RL in various ways
☆107Nov 17, 2020Updated 5 years ago
ppocma / ppocma
View on GitHub
☆73May 24, 2019Updated 6 years ago
jamolnng / OpenCL-CUDA-Tutorials
View on GitHub
Sources for OpenCL and CUDA tutorials. http://jlaning.com
☆20Jan 9, 2016Updated 10 years ago
KyriacosShiarli / taco
View on GitHub
☆25Jan 2, 2019Updated 7 years ago
chamorajg / pl-dreamer
View on GitHub
Simplistic Pytorch Implementation of the Dreamer-RL
☆20May 7, 2025Updated 10 months ago
flowersteam / Curiosity_Driven_Goal_Exploration
View on GitHub
Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"
☆19Oct 26, 2018Updated 7 years ago
XiaoxiaoGuo / atari_uct
View on GitHub
Upper Confidence Tree Planner for ATARI games
☆19Mar 9, 2016Updated 10 years ago
njustesen / a2c_gvgai
View on GitHub
A2C for GVG-AI
☆23Nov 7, 2018Updated 7 years ago
inoryy / Deep-RL-Bootcamp-Labs
View on GitHub
Solutions to the Deep RL Bootcamp labs
☆42Oct 15, 2017Updated 8 years ago
paintception / Deep-Quality-Value-DQV-Learning-
View on GitHub
DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm
☆24Feb 15, 2023Updated 3 years ago
wwxFromTju / sc2-101-zh
View on GitHub
just for fun
☆23Sep 10, 2017Updated 8 years ago
avdmitry / rl_3d
View on GitHub
Reinforcement learning in 3D.
☆21Mar 29, 2017Updated 8 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
tesatory / hsp
View on GitHub
Hierarchical Self-Play
☆21Dec 5, 2018Updated 7 years ago
openai / phasic-policy-gradient
View on GitHub
Code for the paper "Phasic Policy Gradient"
☆268Apr 2, 2023Updated 2 years ago
DartEnv / gym-dart
View on GitHub
OpenAI Gym environment for DART robotics simulator.
☆22Apr 17, 2018Updated 7 years ago
musyoku / gqn-dataset-renderer
View on GitHub
☆26Jul 19, 2019Updated 6 years ago
tedmoskovitz / TOP
View on GitHub
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Jul 18, 2023Updated 2 years ago
bkj / pbt
View on GitHub
Population Based Training, Figure 2
☆25Dec 2, 2017Updated 8 years ago
YuhangSong / DEHRL
View on GitHub
Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.
☆49Feb 23, 2019Updated 7 years ago
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 5 years ago
epignatelli / discovering-reinforcement-learning-algorithms
View on GitHub
A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…
☆23Dec 22, 2020Updated 5 years ago
rubenrtorrado / GVGAI_GYM
View on GitHub
☆107Jan 22, 2020Updated 6 years ago