vub-ai-lab/bdpi

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vub-ai-lab/bdpi)

vub-ai-lab / bdpi

Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration

☆25

Alternatives and similar repositories for bdpi

Users that are interested in bdpi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

voot-t / guide-actor-critic
View on GitHub
Keras implementation of guide actor-critic for continuous control
☆11Mar 12, 2018Updated 8 years ago
msu-anthropology / indian-country-ss18
View on GitHub
This Is Indian Country - Spring 2018 Instance
☆12Apr 30, 2018Updated 8 years ago
plibin / epi-rl
View on GitHub
☆14Jun 21, 2024Updated 2 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
RLAgent / state-marginal-matching
View on GitHub
Efficient Exploration via State Marginal Matching (2019)
☆70Jun 30, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
pfnet-research / capg
View on GitHub
Implementation of clipped action policy gradient (CAPG) with PPO and TRPO
☆31Jun 24, 2018Updated 8 years ago
deep-skill-chaining / deep-skill-chaining
View on GitHub
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆30Sep 24, 2019Updated 6 years ago
sparisi / td-reg
View on GitHub
TD-Regularized Actor-Critic Methods
☆37Dec 26, 2019Updated 6 years ago
mingzhangPHD / Adversarial-Imitation-Learning
View on GitHub
Wasserstein Distance guided Adversarial Imitation Learning (WDAIL) with Reward Shape Exploration
☆19Feb 9, 2021Updated 5 years ago
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 7 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
google-deepmind / egg
View on GitHub
☆19Apr 15, 2026Updated 3 months ago
mcgillmrl / robot_learning
View on GitHub
ROS package for robot learning
☆17Oct 16, 2019Updated 6 years ago
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
facebookresearch / slbo
View on GitHub
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆94Sep 13, 2019Updated 6 years ago
arnomoonens / yarll
View on GitHub
Combining deep learning and reinforcement learning.
☆81Jun 6, 2026Updated last month
jkulhanek / gym-deepmindlab-env
View on GitHub
Gym implementation of connector to Deepmind lab
☆12Mar 26, 2019Updated 7 years ago
jvmncs / ParamNoise
View on GitHub
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆30Mar 14, 2019Updated 7 years ago
RockySJ / ampo
View on GitHub
☆15Oct 20, 2020Updated 5 years ago
philip-w-howard / stackl
View on GitHub
stack based virtual machine interpreter and a C compiler
☆12May 9, 2025Updated last year
xbpeng / awr
View on GitHub
Implementation of advantage-weighted regression.
☆211May 30, 2020Updated 6 years ago
Breakend / ReproducibilityInContinuousPolicyGradientMethods
View on GitHub
These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…
☆17Sep 20, 2017Updated 8 years ago
oxwhirl / opiq
View on GitHub
Code for Optimistic Exploration even with a Pessimistic Initialisation
☆14Aug 4, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
microsoft / oac-explore
View on GitHub
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
☆70Aug 11, 2023Updated 2 years ago
RLG-Leiden / edugym
View on GitHub
☆15Sep 22, 2023Updated 2 years ago
CarperAI / nmmo-environment
View on GitHub
Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research
☆15May 30, 2024Updated 2 years ago
cosmoharrigan / rc-nfq
View on GitHub
RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…
☆12Mar 17, 2021Updated 5 years ago
lafmdp / HIDIL
View on GitHub
[NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"
☆12Nov 24, 2021Updated 4 years ago
febert / robustness_via_retrying
View on GitHub
Robustness via Retrying: Closed-Loop Robotic Manipulation with Self-Supervised Learning
☆16Nov 7, 2018Updated 7 years ago
ehknight / natural-gradient-deep-q-learning
View on GitHub
☆23Oct 7, 2018Updated 7 years ago
miyosuda / rodentia
View on GitHub
3D learning environment with rigid body simulation for Linux/MacOSX
☆14Dec 24, 2021Updated 4 years ago
Innixma / dex
View on GitHub
Continual Learning Toolkit for Reinforcement Learning
☆21Jan 28, 2018Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
wangyuhuix / TrulyPPO
View on GitHub
☆29Nov 21, 2022Updated 3 years ago
lns / dapo
View on GitHub
Source code for the paper "Divergence-Augmented Policy Optimization"
☆37Nov 28, 2019Updated 6 years ago
iffyio / pong.hs
View on GitHub
A pong game written in haskell
☆17Sep 18, 2016Updated 9 years ago
mgualti / DeepRLManip
View on GitHub
Code for "Learning 6-DoF Grasping and Pick-Place Using Attention Focus"
☆22Sep 21, 2018Updated 7 years ago
ShibiHe / Q-Optimality-Tightening
View on GitHub
This is my implementation of the Optimality Tightening
☆37Apr 26, 2017Updated 9 years ago
gioramponi / sigma-girl-MIIRL
View on GitHub
Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions
☆13May 22, 2023Updated 3 years ago
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago