clvoloshin / COBSLinks
OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.
☆61Updated 2 years ago
Alternatives and similar repositories for COBS
Users that are interested in COBS are comparing it to the libraries listed below
Sorting:
- ☆86Updated 10 months ago
- ☆41Updated 3 years ago
- ☆27Updated 5 years ago
- ☆26Updated 2 years ago
- Model-Based Offline Reinforcement Learning☆50Updated 4 years ago
- ☆104Updated 10 months ago
- Code for MOPO: Model-based Offline Policy Optimization☆179Updated 3 years ago
- Code for paper Causal Confusion in Imitation Learning☆45Updated 5 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 5 years ago
- ☆32Updated 2 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 6 years ago
- ☆83Updated 4 years ago
- ☆31Updated 5 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆161Updated 4 years ago
- ☆61Updated 7 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆151Updated last year
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆39Updated 4 years ago
- ☆198Updated 2 years ago
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆25Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 4 years ago
- ☆17Updated 2 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆27Updated 3 years ago
- ☆53Updated last year
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 5 years ago
- Learning Laplacian Representations in Reinforcement Learning☆16Updated 4 years ago
- Revisiting Rainbow☆75Updated 4 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.☆19Updated 6 years ago
- ☆43Updated 6 years ago
- Conservative Q Learning on top of SAC☆131Updated 2 years ago