hu-po / pySACQView external linksLinks
PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)
☆38Feb 13, 2021Updated 5 years ago
Alternatives and similar repositories for pySACQ
Users that are interested in pySACQ are comparing it to the libraries listed below
Sorting:
- Contains the code for "BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning" by Boris Ivanovic, James Harrison, Apoo…☆12Jun 20, 2018Updated 7 years ago
- (Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760☆24May 3, 2019Updated 6 years ago
- ☆26Mar 16, 2023Updated 2 years ago
- PyTorch implementation of DARLA preprocessing models☆11Jan 30, 2018Updated 8 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 6 years ago
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Model-based Policy Gradients☆32Mar 12, 2020Updated 5 years ago
- Variational Inference by Policy Search☆13Apr 24, 2019Updated 6 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Nov 14, 2018Updated 7 years ago
- Implementation of the POIS algorithm☆15Apr 9, 2019Updated 6 years ago
- ☆99Mar 24, 2023Updated 2 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆246Sep 30, 2022Updated 3 years ago
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆17Aug 23, 2024Updated last year
- soft q learning and soft actor critic☆16Dec 23, 2018Updated 7 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 8 years ago
- Implementation of "Training Agents using Upside-Down Reinforcement Learning (https://arxiv.org/pdf/1912.02877.pdf)"☆17Dec 17, 2019Updated 6 years ago
- Code Released for NeurIPS 2018 paper: Synthesized Policies for Transfer and Adaptation across Tasks and Environments☆16Apr 17, 2019Updated 6 years ago
- Episodic Control☆21Sep 20, 2022Updated 3 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆18Dec 13, 2018Updated 7 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- Source code for paper Conservative Uncertainty Estimation By Fitting Prior Networks (ICLR 2020)☆22Nov 28, 2022Updated 3 years ago
- Stein Variational Policy Gradient for REINFORCE☆18Jul 12, 2017Updated 8 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Jun 6, 2019Updated 6 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Jan 15, 2020Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆87Jan 31, 2020Updated 6 years ago
- Random Network Distillation(RND) algo in Pytorch☆51Feb 26, 2019Updated 6 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Mar 25, 2018Updated 7 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- A framework for creating your own reinforcement learning environments using pybullet☆21Oct 7, 2019Updated 6 years ago
- PyTorch implementation of (Deep) Reinforcement Learning (RL) algorithms☆25Jun 26, 2022Updated 3 years ago