clinicalml / gumbel-max-scm
Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)
☆42Updated 4 years ago
Alternatives and similar repositories for gumbel-max-scm:
Users that are interested in gumbel-max-scm are comparing it to the libraries listed below
- Deconfounding Reinforcement Learning in Observational Settings☆49Updated 5 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 3 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆61Updated 2 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 4 years ago
- Code for "Neural causal learning from unknown interventions"☆100Updated 4 years ago
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆37Updated 3 years ago
- Code for paper Causal Confusion in Imitation Learning☆44Updated 5 years ago
- ☆43Updated 2 years ago
- Learning representations for RL in Healthcare under a POMDP assumption☆53Updated last month
- ☆85Updated 6 months ago
- ☆13Updated 5 years ago
- ☆22Updated last year
- ☆37Updated 6 years ago
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆20Updated 2 years ago
- Open AI Gym Environment For MIMIC Dataset Sepsis Patient☆18Updated 2 years ago
- ICU-Sepsis is a lightweight, yet challenging RL environment that models the treatment of sepsis in the ICU.☆14Updated 3 months ago
- References at the Intersection of Causality and Reinforcement Learning☆88Updated 4 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Updated 7 months ago
- Dead-ends and Secure Exploration in Reinforcement Learning☆11Updated 5 years ago
- Generalised UDRL☆37Updated 2 years ago
- Code for ICLR 2020 paper: "Estimating counterfactual treatment outcomes over time through adversarially balanced representations" by I. B…☆58Updated 10 months ago
- ☆26Updated 5 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆33Updated 5 years ago
- Simulation code for reference with MABUC article: Bareinboim, Forney, & Pearl (2015)☆17Updated 9 years ago
- Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.☆19Updated 5 years ago
- Official Implementation of the paper "Variational Causal Networks: Approximate Bayesian Inference over Causal Structures"☆17Updated 3 years ago
- using information theory to encourage agents to cooperate and compete☆19Updated 6 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 4 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- ☆43Updated 3 years ago