Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
☆25Sep 9, 2019Updated 6 years ago
Alternatives and similar repositories for bdpi
Users that are interested in bdpi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- This Is Indian Country - Spring 2018 Instance☆12Apr 30, 2018Updated 8 years ago
- ☆14Jun 21, 2024Updated last year
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Jun 24, 2018Updated 7 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- TD-Regularized Actor-Critic Methods☆36Dec 26, 2019Updated 6 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Wasserstein Distance guided Adversarial Imitation Learning (WDAIL) with Reward Shape Exploration☆19Feb 9, 2021Updated 5 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- Master thesis work: explaining deep reinforcement learning policies☆10Aug 27, 2020Updated 5 years ago
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Combining deep learning and reinforcement learning.☆81Apr 22, 2026Updated last month
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 7 years ago
- Gym implementation of connector to Deepmind lab☆12Mar 26, 2019Updated 7 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- stack based virtual machine interpreter and a C compiler☆12May 9, 2025Updated last year
- Implementation of advantage-weighted regression.☆209May 30, 2020Updated 5 years ago
- ☆12Sep 18, 2025Updated 8 months ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- ☆15Sep 22, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆70Aug 11, 2023Updated 2 years ago
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆15May 30, 2024Updated last year
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 5 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Robustness via Retrying: Closed-Loop Robotic Manipulation with Self-Supervised Learning☆16Nov 7, 2018Updated 7 years ago
- Continual Learning Toolkit for Reinforcement Learning☆21Jan 28, 2018Updated 8 years ago
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆11Oct 22, 2020Updated 5 years ago
- PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"☆10Nov 22, 2019Updated 6 years ago
- 3D learning environment with rigid body simulation for Linux/MacOSX☆14Dec 24, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆29Nov 21, 2022Updated 3 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Apr 27, 2020Updated 6 years ago
- Octax: Accelerated CHIP-8 Arcade Environments for JAX☆54Apr 20, 2026Updated last month
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆41Jan 27, 2018Updated 8 years ago
- Code for "Learning 6-DoF Grasping and Pick-Place Using Attention Focus"☆22Sep 21, 2018Updated 7 years ago
- This is my implementation of the Optimality Tightening☆37Apr 26, 2017Updated 9 years ago