Implement many Sparse Reward algorithms in Gym Fetch environment
☆90Jul 9, 2020Updated 5 years ago
Alternatives and similar repositories for Sparse-Reward-Algorithms
Users that are interested in Sparse-Reward-Algorithms are comparing it to the libraries listed below
Sorting:
- Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.☆17Jun 23, 2021Updated 4 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆270May 20, 2020Updated 5 years ago
- DRLib:a Concise Deep Reinforcement Learning Library, Integrating HER, PER and D2SR for Almost Off-Policy RL Algorithms.☆564Apr 2, 2024Updated last year
- Learning Individual Intrinsic Reward in MARL☆64Dec 8, 2022Updated 3 years ago
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)☆35Nov 28, 2018Updated 7 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- Reinforcement learning in 3D.☆21Mar 29, 2017Updated 8 years ago
- 2D Gridworld navigation using RL with Hindsight Experience Replay☆44May 29, 2019Updated 6 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆89Dec 26, 2018Updated 7 years ago
- Hindsight policy gradients☆46Jan 31, 2020Updated 6 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆67Feb 14, 2020Updated 6 years ago
- Implementations of a large collection of reinforcement learning algorithms.☆28Nov 30, 2023Updated 2 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14May 24, 2021Updated 4 years ago
- Project under CSF407 - AI☆13Jun 24, 2024Updated last year
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Dec 8, 2022Updated 3 years ago
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11May 5, 2020Updated 5 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- The code of paper "Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning in Mixed Cooperative-Competitive …☆15Jul 17, 2021Updated 4 years ago
- Reinforcement Leanring Algorithms Trained with Unity☆13Apr 26, 2019Updated 6 years ago
- 利用python强大的可视化,加深对一些机器人运动规划算法的理解☆12Sep 28, 2019Updated 6 years ago
- G-HER algorithm☆18May 24, 2019Updated 6 years ago
- ☆14Oct 27, 2019Updated 6 years ago
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆13Aug 16, 2019Updated 6 years ago
- Implementation of DDPG+HER on gym robotics environment FetchReach-v1☆33Nov 13, 2018Updated 7 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Jan 19, 2023Updated 3 years ago
- 算法工程师技术栈学习笔记☆15Aug 22, 2022Updated 3 years ago
- ☆18Mar 19, 2019Updated 6 years ago
- Implementations of DQN, DQN with PER, DDQN with PER, and DDDQN with PER agents to maximise reward in a 14 node power grid station☆20Aug 22, 2020Updated 5 years ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆26Aug 9, 2025Updated 6 months ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆23Feb 15, 2023Updated 3 years ago
- This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.☆442Dec 11, 2021Updated 4 years ago
- ☆18Jan 3, 2022Updated 4 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated last year
- ☆17Oct 13, 2019Updated 6 years ago
- Reward Learning by Simulating the Past☆46May 9, 2019Updated 6 years ago