RomainLaroche/SPIBB

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RomainLaroche/SPIBB)

RomainLaroche / SPIBB

Safe Policy Improvement with Baseline Bootstrapping

☆26

Alternatives and similar repositories for SPIBB

Users that are interested in SPIBB are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rems75 / SPIBB-DQN
View on GitHub
Code for SPIBB-DQN and Soft-SPIBB-DQN
☆11May 5, 2020Updated 6 years ago
dtak / POPCORN-POMDP
View on GitHub
Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)
☆11May 19, 2021Updated 5 years ago
zackchase / intrinsic-fear-dqn
View on GitHub
Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.
☆10Nov 13, 2017Updated 8 years ago
zhengwang125 / RLTS
View on GitHub
☆10Jul 23, 2021Updated 5 years ago
VowpalWabbit / estimators
View on GitHub
Estimators to perform off-policy evaluation
☆13Sep 3, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cjlovering / interpretable-reinforcement-learning-using-attention
View on GitHub
[NeurIPS, 2020 - Reproducibility Challenge]: [RE] Towards Interpretable Reinforcement Learning Using Attention Augmented Agents
☆13Apr 26, 2021Updated 5 years ago
StanfordASL / safe_traffic_weaving
View on GitHub
On Infusing Reachability-Based Safety Assurance within Probabilistic Planning Frameworks for Human-Robot Vehicle Interactions
☆18Jul 10, 2020Updated 6 years ago
RonanFR / UCRL
View on GitHub
☆27May 17, 2019Updated 7 years ago
clvoloshin / COBS
View on GitHub
OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.
☆61Aug 9, 2022Updated 3 years ago
suyoung-lee / Episodic-Backward-Update
View on GitHub
Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.
☆16Sep 24, 2019Updated 6 years ago
rjagerman / wsdm2019-nonstationary
View on GitHub
Non-stationary Off-policy Evaluation
☆13Nov 8, 2018Updated 7 years ago
ZishunYu / Actor-Critic-Alignment
View on GitHub
Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''
☆13Oct 12, 2023Updated 2 years ago
justinjfu / diagnosing_qlearning
View on GitHub
Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.
☆17May 14, 2019Updated 7 years ago
clinicalml / gumbel-max-scm
View on GitHub
Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)
☆48Sep 28, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
frt03 / mxt_bench
View on GitHub
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)
☆14Feb 3, 2023Updated 3 years ago
microsoft / oac-explore
View on GitHub
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
☆70Aug 11, 2023Updated 2 years ago
sfujim / LAP-PAL
View on GitHub
Author's PyTorch implementation of LAP and PAL with TD3 and DDQN
☆41Dec 7, 2021Updated 4 years ago
ermongroup / CalibratedModelBasedRL
View on GitHub
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆54May 15, 2019Updated 7 years ago
sisl / AutomotiveSafeRL
View on GitHub
Training and evaluation scripts for applying formal methods and reinforcement learning to autonomous driving problems.
☆26Feb 21, 2020Updated 6 years ago
google-research / batch_rl
View on GitHub
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
☆560Jun 26, 2023Updated 3 years ago
microsoft / StateDecoding
View on GitHub
Reinforcement Learning via Latent State Decoding
☆29Jun 12, 2023Updated 3 years ago
secury / optidice
View on GitHub
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
☆16Aug 3, 2023Updated 2 years ago
yilundu / task_agnostic_dynamics_prior
View on GitHub
Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning
☆12Jun 13, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sfujim / BCQ
View on GitHub
Author's PyTorch implementation of BCQ for continuous and discrete actions
☆667Apr 6, 2021Updated 5 years ago
breuleux / hrepr
View on GitHub
HTML representation for Python objects.
☆17Apr 16, 2025Updated last year
KAIST-AILab / gmmil
View on GitHub
Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"
☆11Oct 2, 2018Updated 7 years ago
laventura / carnd.path.planning
View on GitHub
Path-Planning for Self-Driving Car. Implemented a behavior planner in C++. Project for Udacity Self-Driving Car Nanodegree.
☆11Aug 11, 2017Updated 8 years ago
randriu / paynt
View on GitHub
☆23Updated this week
lafmdp / HIDIL
View on GitHub
[NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"
☆12Nov 24, 2021Updated 4 years ago
strumswell / twitter-follower-graph
View on GitHub
Twitter follower graphs of @Die_Gruenen & @AfD, including cluster and topic analysis
☆10Jul 10, 2020Updated 6 years ago
lionelblonde / sam-pytorch
View on GitHub
PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"
☆10Nov 22, 2019Updated 6 years ago
jasonroy0 / BNP-short-course
View on GitHub
☆15Jul 24, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gioramponi / sigma-girl-MIIRL
View on GitHub
Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions
☆13May 22, 2023Updated 3 years ago
zygmuntz / metric-learning-for-regression
View on GitHub
Applying metric learning to kin8nm
☆16Nov 10, 2014Updated 11 years ago
LAVA-LAB / safe-slac
View on GitHub
Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.
☆11Mar 1, 2023Updated 3 years ago
abbyvansoest / maxent
View on GitHub
☆14May 30, 2019Updated 7 years ago
AdityaMate / collapsing_bandits
View on GitHub
Code repo for "Collapsing Bandits and Their Applications to Public Health Interventions", (NeurIPS'20)
☆11Dec 3, 2025Updated 7 months ago
alainray / causal_inference
View on GitHub
Repository for my studies of Causal Inference
☆10Dec 1, 2019Updated 6 years ago
Hyeokreal / ali_bigan_mnist_pytorch
View on GitHub
☆10Aug 8, 2017Updated 8 years ago