voot-t/guide-actor-critic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/voot-t/guide-actor-critic)

voot-t / guide-actor-critic

Keras implementation of guide actor-critic for continuous control

☆11

Alternatives and similar repositories for guide-actor-critic

Users that are interested in guide-actor-critic are comparing it to the libraries listed below

Sorting:

vub-ai-lab / bdpi
View on GitHub
Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
☆25Sep 9, 2019Updated 6 years ago
sparisi / td-reg
View on GitHub
TD-Regularized Actor-Critic Methods
☆36Dec 26, 2019Updated 6 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 6 years ago
mossr / CrossEntropyVariants.jl
View on GitHub
Cross-entropy method variants for optimization in Julia
☆12Apr 29, 2021Updated 4 years ago
seungyulhan / disc
View on GitHub
☆10Aug 17, 2022Updated 3 years ago
mcgillmrl / robot_learning
View on GitHub
ROS package for robot learning
☆17Oct 16, 2019Updated 6 years ago
oxwhirl / opiq
View on GitHub
Code for Optimistic Exploration even with a Pessimistic Initialisation
☆14Aug 4, 2020Updated 5 years ago
jparkerholder / ASEBO
View on GitHub
Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …
☆16Oct 14, 2020Updated 5 years ago
RLAgent / state-marginal-matching
View on GitHub
Efficient Exploration via State Marginal Matching (2019)
☆69Jun 30, 2019Updated 6 years ago
flowersteam / geppg
View on GitHub
☆36Aug 10, 2018Updated 7 years ago
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
Breakend / ReproducibilityInContinuousPolicyGradientMethods
View on GitHub
These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…
☆17Sep 20, 2017Updated 8 years ago
philipjball / OffCon3
View on GitHub
📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)
☆25Jun 20, 2021Updated 4 years ago
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago
Innixma / dex
View on GitHub
Continual Learning Toolkit for Reinforcement Learning
☆21Jan 28, 2018Updated 8 years ago
ehknight / natural-gradient-deep-q-learning
View on GitHub
☆23Oct 7, 2018Updated 7 years ago
facebookresearch / reward-estimator-corl
View on GitHub
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆22Oct 26, 2018Updated 7 years ago
karush17 / esac
View on GitHub
Evolution-based Soft Actor-Critic (ESAC)
☆42Jul 25, 2024Updated last year
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 5 years ago
mklissa / PPOC
View on GitHub
Proximal Policy Option-Critic
☆26Jan 4, 2019Updated 7 years ago
deep-skill-chaining / deep-skill-chaining
View on GitHub
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆30Sep 24, 2019Updated 6 years ago
MontenegroAlessandro / MagicRL
View on GitHub
☆14Jan 24, 2026Updated last month
OliverRichter / map-reader
View on GitHub
Tensorflow implementation of the map reading algorithm described in ‘Teaching a Machine to Read Maps with Deep Reinforcement Learning’
☆32Nov 14, 2017Updated 8 years ago
mcmachado / count_based_exploration_sr
View on GitHub
☆31Jul 1, 2019Updated 6 years ago
pfnet-research / capg
View on GitHub
Implementation of clipped action policy gradient (CAPG) with PPO and TRPO
☆31Jun 24, 2018Updated 7 years ago
rasoolfa / P3O
View on GitHub
P3O paper code
☆30Aug 7, 2019Updated 6 years ago
i5lab / Platoon-Simulation
View on GitHub
Compare Laguerre-based MPC and Traditional MPC for platoon of vehicles.
☆13Feb 14, 2023Updated 3 years ago
Silvicek / distributional-dqn
View on GitHub
Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…
☆133May 5, 2019Updated 6 years ago
mcgillmrl / kusanagi
View on GitHub
Library for model based RL in robotics
☆37Sep 10, 2018Updated 7 years ago
da-molchanov / variance-networks
View on GitHub
Variance Networks: When Expectation Does Not Meet Your Expectations, ICLR 2019
☆39Jan 31, 2020Updated 6 years ago
michaelkoelle / marl-aquarium
View on GitHub
Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics through Multi-Agent Reinforcement Learning Algorithms
☆13Apr 3, 2024Updated last year
amakadia / svd_for_pose
View on GitHub
☆11Nov 3, 2020Updated 5 years ago
lns / dapo
View on GitHub
Source code for the paper "Divergence-Augmented Policy Optimization"
☆37Nov 28, 2019Updated 6 years ago
aijunbai / markov-game
View on GitHub
Stochastic Markov Games
☆12Oct 5, 2017Updated 8 years ago
thanh-isu / double-sparse-coding
View on GitHub
Matlab code for learning doubly sparse dictionary on synthetic data. Details can be found in the paper "A Provable Approach for Double-Sp…
☆11Mar 5, 2018Updated 8 years ago
ertsiger / induction-subgoal-automata-rl
View on GitHub
Code for the papers "Induction of Subgoal Automata for Reinforcement Learning" (AAAI-20) and "Induction and Exploitation of Subgoal Autom…
☆13Aug 15, 2023Updated 2 years ago
aldryd / imageswitcher
View on GitHub
Example of how to use an ViewSwitcher to switch between two ImageView objects
☆13Dec 16, 2012Updated 13 years ago
flowersteam / teachDeepRL
View on GitHub
☆91Jun 8, 2021Updated 4 years ago