Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
☆25Sep 9, 2019Updated 6 years ago
Alternatives and similar repositories for bdpi
Users that are interested in bdpi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- ☆14Jun 21, 2024Updated last year
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO