clvoloshin / COBSLinks

OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.

☆61

Alternatives and similar repositories for COBS

Users that are interested in COBS are comparing it to the libraries listed below

Sorting:

google-research / deep_ope
☆86Updated 11 months ago
davidbrandfonbrener / onestep-rl
☆41Updated 3 years ago
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆179Updated 3 years ago
Farama-Foundation / D4RL-Evaluations
☆198Updated 2 years ago
dennisl88 / rand_param_envs
Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7
☆20Updated 6 years ago
spitis / mrl
☆113Updated 2 years ago
aviralkumar2907 / BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆161Updated 4 years ago
SwapnilPande / MOReL
Model-Based Offline Reinforcement Learning
☆50Updated 4 years ago
MadryLab / implementation-matters
☆132Updated 11 months ago
eric-mitchell / macaw
Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]
☆47Updated 2 years ago
lanyavik / BAIL
☆17Updated 3 years ago
snu-mllab / EDAC
Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)
☆75Updated 2 years ago
thanard / me-trpo
☆92Updated last year
yifan12wu / rl-laplacian
Learning Laplacian Representations in Reinforcement Learning
☆16Updated 4 years ago
facebookresearch / deep_bisim4control
Learning Invariant Representations for Reinforcement Learning without Reconstruction
☆148Updated 3 years ago
kaixin96 / rl-generalization-paper
A list of papers regarding generalization in (deep) reinforcement learning
☆151Updated last year
mcmachado / count_based_exploration_sr
☆31Updated 6 years ago
Ji4chenLi / Multi-Task-Batch-RL
☆26Updated 2 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆87Updated 4 years ago
robintyh1 / onpolicybaselines
on-policy optimization baselines for deep reinforcement learning
☆30Updated 5 years ago
facebookresearch / icp-block-mdp
Invariant Causal Prediction for Block MDPs
☆44Updated 5 years ago
nuria95 / O-RAAC
Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting
☆35Updated 4 years ago
hiwonjoon / ICML2019-TREX
☆83Updated 4 years ago
suyoung-lee / Episodic-Backward-Update
Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.
☆17Updated 5 years ago
RomainLaroche / SPIBB
Safe Policy Improvement with Baseline Bootstrapping
☆26Updated 5 years ago
lmzintgraf / varibad
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)
☆191Updated 2 years ago
ml-jku / OfflineRL
☆31Updated 2 years ago
rraileanu / idaac
☆54Updated last year
pokaxpoka / sunrise
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
☆125Updated 4 years ago
ben-eysenbach / sac
Soft Actor-Critic
☆151Updated 7 years ago