alirezakazemipour / SAC
Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.
☆20Updated last month
Related projects: ⓘ
- ☆39Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆70Updated 9 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆87Updated 3 years ago
- The implementation of LSTM-TD3.☆60Updated last year
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆75Updated 9 months ago
- This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…☆17Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆13Updated 9 months ago
- DSAC; Distributional Soft Actor-Critic☆108Updated 6 months ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆61Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆108Updated last year
- PyTorch implementation of DDPG for continuous control tasks.☆41Updated 4 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆13Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆41Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆146Updated last year
- A DRL implementation repo☆19Updated this week
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆34Updated 4 years ago
- Implementation of PPO Lagrangian in PyTorch☆34Updated 2 years ago
- PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method☆26Updated 3 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated 10 months ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆66Updated 5 years ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆30Updated 3 years ago
- Distributional Soft Actor Critic☆49Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆44Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆34Updated 4 years ago
- [ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control☆99Updated 3 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆150Updated 2 years ago
- Transformer in RL for decision-making☆71Updated last year
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆113Updated last month
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆56Updated this week
- Implementation of Off Policy Adversarial Inverse Reinforcement Learning☆20Updated 3 years ago