young-geng / SimpleSAC
A simple and easy to use implementation of the soft actor-critic algorithm.
☆15Updated 2 years ago
Related projects: ⓘ
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆30Updated 4 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆22Updated 5 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- ☆26Updated 5 years ago
- Revisiting Rainbow☆73Updated 3 years ago
- My Body Is A Cage☆37Updated 3 years ago
- Change-Based Exploration Transfer☆35Updated 2 years ago
- ☆23Updated last year
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆21Updated 5 months ago
- ☆41Updated 5 years ago
- ☆23Updated last month
- ☆30Updated 10 months ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆15Updated 6 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆15Updated 2 years ago
- Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning☆27Updated 4 years ago
- ☆14Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 3 years ago
- Efficient Exploration via State Marginal Matching (2019)☆66Updated 5 years ago
- on-policy optimization baselines for deep reinforcement learning☆28Updated 4 years ago
- ☆39Updated 3 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆17Updated last year
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆24Updated 3 years ago
- Single Episode Policy Transfer in Reinforcement Learning☆17Updated 2 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆15Updated 4 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆35Updated 2 weeks ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆28Updated 3 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆21Updated 2 years ago
- Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.☆19Updated 5 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Updated 3 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆17Updated 3 years ago