mklissa/PPOC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mklissa/PPOC)

mklissa / PPOC

Proximal Policy Option-Critic

☆26

Alternatives and similar repositories for PPOC

Users that are interested in PPOC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

arushijain94 / SafeOptionCritic
View on GitHub
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
☆21Dec 16, 2018Updated 7 years ago
seungyulhan / disc
View on GitHub
☆10Aug 17, 2022Updated 3 years ago
jeanharb / a2oc_delib
View on GitHub
A3C style Option-Critic with deliberation cost
☆40Jan 9, 2018Updated 8 years ago
tdavchev / option-critic
View on GitHub
A Tensorflow implementation of the Option-Critic Architecture
☆75Jun 1, 2017Updated 9 years ago
alversafa / option-critic-arch
View on GitHub
Implementation of the Option-Critic Architecture
☆42Dec 9, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
robintyh1 / icml2021-pengqlambda
View on GitHub
Revisiting Peng's Q(lambda) for Modern Reinforcement Learning
☆15Jul 23, 2021Updated 4 years ago
kngwyu / Rainy
View on GitHub
Deep RL agents with PyTorch
☆35Sep 25, 2021Updated 4 years ago
deep-skill-chaining / deep-skill-chaining
View on GitHub
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆30Sep 24, 2019Updated 6 years ago
mila-iqia / teamgrid
View on GitHub
Multiagent gridworld for the TEAM project based on gym-minigrid
☆12Nov 27, 2019Updated 6 years ago
lweitkamp / option-critic-pytorch
View on GitHub
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆146Aug 2, 2024Updated last year
jeanharb / option_critic
View on GitHub
Implementation of the Option-Critic Architecture on the Atari (ALE) environment
☆183Sep 21, 2017Updated 8 years ago
Steven-Ho / VALOR
View on GitHub
Implementation of VALOR (Variational Option Discovery Algorithms)
☆10Jun 28, 2019Updated 7 years ago
hzm2016 / option-critic-pytorch
View on GitHub
☆15Nov 21, 2022Updated 3 years ago
nnaisense / MAGE
View on GitHub
Learning Action-Value Gradients in Model-based Policy Optimization
☆32Sep 7, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
rll-research / cic
View on GitHub
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
☆88Jul 27, 2022Updated 3 years ago
Mehooz / awesome-long-horizon-goal-reaching
View on GitHub
Personal reading list for learning-based long-horizon goal reaching methods
☆17Nov 26, 2020Updated 5 years ago
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago
veronicachelu / temporal_abstraction
View on GitHub
Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…
☆24Nov 29, 2018Updated 7 years ago
zhougroup / IDAC
View on GitHub
Implicit Distributional Actor Critic
☆11Dec 8, 2021Updated 4 years ago
ben-eysenbach / sac
View on GitHub
Soft Actor-Critic
☆160Mar 13, 2018Updated 8 years ago
jinnaiyuu / Optimal-Options-ICML-2019
View on GitHub
Code for generating options for planning and reinforcement learning
☆12Feb 18, 2021Updated 5 years ago
mcmachado / count_based_exploration_sr
View on GitHub
☆31Jul 1, 2019Updated 7 years ago
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
entity-neural-network / incubator
View on GitHub
Collection of in-progress libraries for entity neural networks.
☆29Jun 24, 2022Updated 4 years ago
joeybose / FloRL
View on GitHub
Implicit Normalizing Flows + Reinforcement Learning
☆62May 31, 2019Updated 7 years ago
neitzal / adaptive-skip-intervals
View on GitHub
Implementation of the paper "Adaptive Skip Intervals: Temporal Abstraction for Recurrent Dynamical Models"
☆24Sep 7, 2018Updated 7 years ago
junsu-kim97 / HIGL
View on GitHub
PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).
☆32Oct 27, 2021Updated 4 years ago
voot-t / guide-actor-critic
View on GitHub
Keras implementation of guide actor-critic for continuous control
☆11Mar 12, 2018Updated 8 years ago
aluscher / torchbeastpopart
View on GitHub
Deep Learning Project
☆23Jan 18, 2020Updated 6 years ago
supratikp / HOOF
View on GitHub
Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583
☆19Oct 22, 2019Updated 6 years ago
iral-lab / gold
View on GitHub
Multimodal grounded language dataset
☆11Dec 14, 2021Updated 4 years ago
IouJenLiu / HTS-RL
View on GitHub
☆21Dec 22, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
montrealrobotics / domain-randomizer
View on GitHub
A standalone library to randomize various OpenAI Gym Environments
☆66Sep 29, 2019Updated 6 years ago
bhairavmehta95 / data-efficient-hrl
View on GitHub
Implementation of Data Efficient Reinforcement Learning in Pytorch
☆20Aug 6, 2019Updated 6 years ago
jqueeney / geppo
View on GitHub
Generalized Proximal Policy Optimization with Sample Reuse (GePPO)
☆29Jul 24, 2023Updated 2 years ago
oxwhirl / opiq
View on GitHub
Code for Optimistic Exploration even with a Pessimistic Initialisation
☆14Aug 4, 2020Updated 5 years ago
XiaoxiaoGuo / atari_uct
View on GitHub
Upper Confidence Tree Planner for ATARI games
☆19Mar 9, 2016Updated 10 years ago
Santara / stochastic_value_gradient
View on GitHub
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆25Jan 15, 2022Updated 4 years ago
spitis / mrl
View on GitHub
☆119Apr 28, 2023Updated 3 years ago