PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)
☆38Feb 13, 2021Updated 5 years ago
Alternatives and similar repositories for pySACQ
Users that are interested in pySACQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760☆24May 3, 2019Updated 7 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago
- Contains the code for "BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning" by Boris Ivanovic, James Harrison, Apoo…☆12Jun 20, 2018Updated 7 years ago
- soft q learning and soft actor critic☆16Dec 23, 2018Updated 7 years ago
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆17Aug 23, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020☆26Dec 8, 2022Updated 3 years ago
- A framework for creating your own reinforcement learning environments using pybullet☆21Oct 7, 2019Updated 6 years ago
- PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)☆34Jun 22, 2022Updated 3 years ago
- ☆26Mar 16, 2023Updated 3 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- Implementation of "Training Agents using Upside-Down Reinforcement Learning (https://arxiv.org/pdf/1912.02877.pdf)"☆17Dec 17, 2019Updated 6 years ago
- Variational Inference by Policy Search☆13Apr 24, 2019Updated 7 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Authors' implementation of PEER☆11Jul 13, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Model-based Policy Gradients☆32Mar 12, 2020Updated 6 years ago
- Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023☆57Feb 3, 2023Updated 3 years ago
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- RLBench simulation project for autonomous bin picking using Pandas robot arm☆10Mar 1, 2021Updated 5 years ago
- Source code for paper Conservative Uncertainty Estimation By Fitting Prior Networks (ICLR 2020)☆22Nov 28, 2022Updated 3 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- Actor Prioritized Experience Replay☆19Nov 20, 2023Updated 2 years ago
- [CVPR 2026]☆75May 20, 2026Updated last week
- Code for the NeurIPS 2019 submission: "Improving Black-box Adversarial Attacks with a Transfer-based Prior".☆15May 6, 2020Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆248Sep 30, 2022Updated 3 years ago
- ☆23Jun 8, 2021Updated 4 years ago
- ☆14Sep 27, 2019Updated 6 years ago
- Using deep learning and reinforcement learning in a Enigma Catalyst algorithm.☆11Jul 13, 2019Updated 6 years ago
- Deep RL for portfolio management☆13Aug 31, 2018Updated 7 years ago
- Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…☆32Jan 7, 2021Updated 5 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Jan 15, 2020Updated 6 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 7 years ago
- PyTorch Implementations of Augmented Random Search☆17Feb 28, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- ☆99Mar 24, 2023Updated 3 years ago
- A python implementation of tile coding using numpy.☆11May 13, 2017Updated 9 years ago
- Interactive Text2Pickup Network for Natural Language based Human-Robot Collaboration☆11Sep 28, 2018Updated 7 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.☆10Nov 8, 2018Updated 7 years ago
- ☆12May 27, 2019Updated 6 years ago