quanvuong/Supervised_Policy_Update

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/quanvuong/Supervised_Policy_Update)

quanvuong / Supervised_Policy_Update

Code to reproduce Supervised Policy Update (ICLR 2019)

☆17

Alternatives and similar repositories for Supervised_Policy_Update

Users that are interested in Supervised_Policy_Update are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tgangwani / SelfImitationDiverse
View on GitHub
Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
☆20Nov 26, 2020Updated 5 years ago
voot-t / guide-actor-critic
View on GitHub
Keras implementation of guide actor-critic for continuous control
☆11Mar 12, 2018Updated 8 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
seungyulhan / disc
View on GitHub
☆10Aug 17, 2022Updated 3 years ago
flowersteam / rl-difference-testing
View on GitHub
Simple tools for statistical analyses in RL experiments
☆67Jun 21, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ryanxhr / BEAR
View on GitHub
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
☆11Oct 29, 2019Updated 6 years ago
facebookresearch / slbo
View on GitHub
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆94Sep 13, 2019Updated 6 years ago
ryoungj / ZO-L2L
View on GitHub
[ICLR'20] Learning to Learn by Zeroth-Order Oracle
☆14Feb 7, 2020Updated 6 years ago
cxxgtxy / POP3D
View on GitHub
Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
☆44Nov 8, 2018Updated 7 years ago
floringogianu / categorical-dqn
View on GitHub
A working implementation of the Categorical DQN (Distributional RL).
☆95Apr 7, 2018Updated 8 years ago
Scitator / Run-Skeleton-Run
View on GitHub
Reason8.ai PyTorch solution for NIPS RL 2017 challenge
☆84Oct 15, 2019Updated 6 years ago
jvmncs / ParamNoise
View on GitHub
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆30Mar 14, 2019Updated 7 years ago
Wenxuan-Zhou / EPI
View on GitHub
Code for Environment Probing Interaction Policies [ICLR 2019]
☆30Jun 17, 2019Updated 7 years ago
xkianteb / dril
View on GitHub
Disagreement-Regularized Imitation Learning
☆30May 25, 2021Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
roosephu / slbo
View on GitHub
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆55Jul 26, 2019Updated 6 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
wensun / Imitation-Learning-from-Observation
View on GitHub
☆24Jul 6, 2023Updated 3 years ago
KyriacosShiarli / taco
View on GitHub
☆25Jan 2, 2019Updated 7 years ago
teslacool / m-curl
View on GitHub
M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning
☆29Nov 5, 2020Updated 5 years ago
HumanCompatibleAI / rlsp
View on GitHub
Reward Learning by Simulating the Past
☆46May 9, 2019Updated 7 years ago
Santara / RAIL
View on GitHub
Codebase of Santara et. al., RAIL: Risk Averse Imitation Learning, Published in AAMAS 2018
☆15Jan 15, 2022Updated 4 years ago
Hwhitetooth / lirpg
View on GitHub
☆63Jun 22, 2018Updated 8 years ago
junhyukoh / self-imitation-learning
View on GitHub
ICML 2018 Self-Imitation Learning
☆277Apr 18, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
dyne-submission / dynamics-aware-embeddings
View on GitHub
☆16Sep 25, 2019Updated 6 years ago
wulfebw / async_rl
View on GitHub
Python implementation of tabular asynchronous actor critic
☆11May 3, 2016Updated 10 years ago
ZachisGit / LearningFromHumanPreferences
View on GitHub
Learning From Human Preferences - Tensorflow+Keras Implementation
☆18Aug 17, 2017Updated 8 years ago
Kaixhin / Dist-A3C
View on GitHub
Distributed A3C
☆34Dec 22, 2017Updated 8 years ago
idlrl / flare
View on GitHub
RL framework for embodied agents based on PyTorch
☆11Apr 11, 2019Updated 7 years ago
oswsnqc / Tensorflow-DPPO
View on GitHub
self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow
☆12Sep 1, 2017Updated 8 years ago
atabakd / MuJoCo-Tutorials
View on GitHub
My understanding of Russ Tedrake's course on umderacuated robotics, implemented in MuJoCo
☆45Oct 23, 2019Updated 6 years ago
AutumnWu / Streamlined-Off-Policy-Learning
View on GitHub
ICRL 2020
☆20Feb 18, 2020Updated 6 years ago
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
VincentYu68 / SymmetryCurriculumLocomotion
View on GitHub
☆53Jan 31, 2019Updated 7 years ago
YyzHarry / SV-RL
View on GitHub
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
☆34Feb 1, 2020Updated 6 years ago
s-kajita / BasicGaitGen
View on GitHub
Matlab code for basic gait generator for students
☆10Sep 25, 2020Updated 5 years ago
Games-and-Simulations / StarAlgo
View on GitHub
A squad movement planning library for StarCraft AI using Monte Carlo Tree Search and Negamax
☆14Jan 1, 2019Updated 7 years ago
hyparxis / gym-cassie
View on GitHub
An OpenAI Gym style reinforcement learning interface for Agility Robotics' biped robot Cassie
☆41Apr 23, 2019Updated 7 years ago
david-abel / rl_info_theory
View on GitHub
A collection of code investigating the use of information theory for abstractions in RL
☆16Nov 14, 2018Updated 7 years ago
agakshat / spacefortress
View on GitHub
OpenAI Gym compatible reinforcement learning environment for Space Fortress https://arxiv.org/abs/1809.02206
☆11Aug 30, 2024Updated last year