clinicalml / gumbel-max-scm
Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)
☆42Updated 4 years ago
Alternatives and similar repositories for gumbel-max-scm:
Users that are interested in gumbel-max-scm are comparing it to the libraries listed below
- Deconfounding Reinforcement Learning in Observational Settings☆50Updated 5 years ago
- Code for "Neural causal learning from unknown interventions"☆99Updated 4 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆61Updated 2 years ago
- ☆85Updated 7 months ago
- Learning representations for RL in Healthcare under a POMDP assumption☆52Updated 2 months ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 3 years ago
- ☆37Updated 6 years ago
- ☆13Updated 5 years ago
- ☆22Updated last year
- Open AI Gym Environment For MIMIC Dataset Sepsis Patient☆18Updated 2 years ago
- ☆43Updated 2 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 4 years ago
- ICU-Sepsis is a lightweight, yet challenging RL environment that models the treatment of sepsis in the ICU.☆17Updated 5 months ago
- References at the Intersection of Causality and Reinforcement Learning☆89Updated 4 years ago
- Code for paper Causal Confusion in Imitation Learning☆45Updated 5 years ago
- Reimplementation of NOTEARS in Tensorflow☆35Updated last year
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆31Updated 7 years ago
- ☆76Updated 3 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆33Updated 5 years ago
- Simulation code for reference with MABUC article: Bareinboim, Forney, & Pearl (2015)☆17Updated 9 years ago
- ☆44Updated 3 years ago
- Official Implementation of the paper "Variational Causal Networks: Approximate Bayesian Inference over Causal Structures"☆17Updated 3 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆19Updated 4 years ago
- Structural Causal Bandit☆24Updated 3 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11Updated 3 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Updated 8 months ago
- Code for NeurIPS 2021 paper: "Invariant Causal Imitation Learning for Generalizable Policies" by I. Bica, D. Jarrett, M. van der Schaar☆27Updated 3 years ago
- using information theory to encourage agents to cooperate and compete☆19Updated 6 years ago
- ☆32Updated 6 years ago
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Updated 5 years ago