hu-po/pySACQ

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hu-po/pySACQ)

hu-po / pySACQ

PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)

☆39

Alternatives and similar repositories for pySACQ

Users that are interested in pySACQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yosider / merlin
View on GitHub
(Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760
☆25May 3, 2019Updated 7 years ago
StanfordASL / BaRC
View on GitHub
Contains the code for "BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning" by Boris Ivanovic, James Harrison, Apoo…
☆12Jun 20, 2018Updated 8 years ago
zdhNarsil / Stochastic-Marginal-Actor-Critic
View on GitHub
Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".
☆24Feb 9, 2023Updated 3 years ago
LihaoR / Entropy-Regularized-RL
View on GitHub
soft q learning and soft actor critic
☆16Dec 23, 2018Updated 7 years ago
chaovven / SMIX
View on GitHub
Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020
☆26Dec 8, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
utiasSTARS / lfgp
View on GitHub
Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code
☆17Aug 23, 2024Updated last year
chaovven / PyRL
View on GitHub
PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)
☆34Jun 22, 2022Updated 4 years ago
Ji4chenLi / Multi-Task-Batch-RL
View on GitHub
☆26Mar 16, 2023Updated 3 years ago
ruizhaogit / mep
View on GitHub
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆24May 30, 2019Updated 7 years ago
parthchadha / upsideDownRL
View on GitHub
Implementation of "Training Agents using Upside-Down Reinforcement Learning (https://arxiv.org/pdf/1912.02877.pdf)"
☆17Dec 17, 2019Updated 6 years ago
OlegArenz / VIPS
View on GitHub
Variational Inference by Policy Search
☆13Apr 24, 2019Updated 7 years ago
5vision / uct_atari
View on GitHub
uct tree search + supervised lerning for atari games
☆12Feb 14, 2017Updated 9 years ago
sweetice / PEER-CVPR23
View on GitHub
Authors' implementation of PEER
☆11Jul 13, 2023Updated 3 years ago
DrifterFun / Geometric-Consistency-Model
View on GitHub
Synthetic Camera Simulator - Unreal Engine4 Plugin
☆10Nov 2, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
MahanFathi / Model-Based-RL
View on GitHub
Model-based Policy Gradients
☆32Mar 12, 2020Updated 6 years ago
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
vikashplus / unitree_sim
View on GitHub
MuJoCo models for Unitree Robots
☆12Nov 24, 2021Updated 4 years ago
google-research / policy-learning-landscape
View on GitHub
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Jan 16, 2019Updated 7 years ago
theogruner / rl_pro_telu
View on GitHub
☆23Jun 8, 2021Updated 5 years ago
microsoft / conservative-uncertainty-estimation-random-priors
View on GitHub
Source code for paper Conservative Uncertainty Estimation By Fitting Prior Networks (ICLR 2020)
☆22Nov 28, 2022Updated 3 years ago
jonasrothfuss / ProMP
View on GitHub
Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…
☆247Sep 30, 2022Updated 3 years ago
murtazarang / MD-MADDPG
View on GitHub
☆14Sep 27, 2019Updated 6 years ago
felixenzogarofalo / Deep-Learning-in-Catalyst
View on GitHub
Using deep learning and reinforcement learning in a Enigma Catalyst algorithm.
☆11Jul 13, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yakovmon / Real-Time-Audio-Visual-Speech-Enhancement
View on GitHub
☆13May 27, 2019Updated 7 years ago
jjakimoto / PPO-Pytorch
View on GitHub
Deep RL for portfolio management
☆13Aug 31, 2018Updated 7 years ago
rcheng805 / CORE-RL
View on GitHub
Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…
☆32Jan 7, 2021Updated 5 years ago
aletcher / stable-opponent-shaping
View on GitHub
Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).
☆21Jan 15, 2020Updated 6 years ago
ascane / gym-gazebo-hsr
View on GitHub
An OpenAI gym environment based on Gazebo and ROS for Human Support Robot (HSR)
☆16Sep 6, 2018Updated 7 years ago
jvmncs / ParamNoise
View on GitHub
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆30Mar 14, 2019Updated 7 years ago
viig99 / mkscancer
View on GitHub
Keras Deep Learning 1d resnet solution for the https://www.kaggle.com/c/msk-redefining-cancer-treatment challenge
☆12Nov 10, 2017Updated 8 years ago
nickswalker / gpsr-command-understanding
View on GitHub
Tools for understanding natural language robot commands
☆12Feb 21, 2021Updated 5 years ago
lasgroup / aceirl
View on GitHub
Implementation of "Active Exploration for Inverse Reinforcement Learning (AceIRL), NeurIPS 2022.
☆14Oct 12, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
nnaisense / MAGE
View on GitHub
Learning Action-Value Gradients in Model-based Policy Optimization
☆32Sep 7, 2021Updated 4 years ago
WilsonWangTHU / POPLIN
View on GitHub
☆99Mar 24, 2023Updated 3 years ago
gehring / tilecoding
View on GitHub
A python implementation of tile coding using numpy.
☆11May 13, 2017Updated 9 years ago
sarlinpe / Concrete-Dropout
View on GitHub
A clean TensorFlow implementation of Concrete Dropout
☆22Jan 16, 2018Updated 8 years ago
tianheyu927 / mil
View on GitHub
Code for "One-Shot Visual Imitation Learning via Meta-Learning"
☆290Oct 8, 2018Updated 7 years ago
hiddenmaze / InteractivePickup
View on GitHub
Interactive Text2Pickup Network for Natural Language based Human-Robot Collaboration
☆11Sep 28, 2018Updated 7 years ago
PhilippeMorere / EMU-Q
View on GitHub
Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.
☆10Nov 8, 2018Updated 7 years ago