Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)
☆47Sep 28, 2020Updated 5 years ago
Alternatives and similar repositories for gumbel-max-scm
Users that are interested in gumbel-max-scm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning representations for RL in Healthcare under a POMDP assumption☆58Jan 21, 2025Updated last year
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- Code for "Trajectory Inspection: A Method for Iterative Clinician-Driven Design of Reinforcement Learning Studies"☆16Oct 15, 2020Updated 5 years ago
- Official Implementation of the paper "Variational Causal Networks: Approximate Bayesian Inference over Causal Structures"☆17Nov 19, 2021Updated 4 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Sepsis cohort from MIMIC dataset☆128Jul 6, 2023Updated 2 years ago
- Causal data augmentation for pretraining debiasing☆11Aug 31, 2021Updated 4 years ago
- ☆22Oct 4, 2019Updated 6 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26May 5, 2020Updated 6 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆63Aug 9, 2022Updated 3 years ago
- Reinforcement learning for medical decisions☆129Dec 17, 2019Updated 6 years ago
- ☆24May 13, 2018Updated 7 years ago
- ☆15May 15, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Comp 781 Project☆10Jan 2, 2026Updated 4 months ago
- Estimators to perform off-policy evaluation☆13Sep 3, 2024Updated last year
- ☆17Mar 21, 2017Updated 9 years ago
- Active Learning with Partial Feedback, ICLR 2019☆11Apr 27, 2020Updated 6 years ago
- Open AI Gym Environment For MIMIC Dataset Sepsis Patient☆24Dec 8, 2022Updated 3 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆52Apr 13, 2019Updated 7 years ago
- Causal Effect Inference for Structured Treatments (SIN) (NeurIPS 2021)☆42Apr 26, 2022Updated 4 years ago
- ☆27Oct 25, 2019Updated 6 years ago
- Code to reproduce our paper on probabilistic algorithmic recourse: https://arxiv.org/abs/2006.06831☆37Dec 27, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆22Jul 27, 2022Updated 3 years ago
- A Recurrent Latent Variable Model for Sequential Data☆28May 8, 2018Updated 8 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆11Oct 21, 2024Updated last year
- Modelling the Multiwavelength Variability of Mrk-335 using Gaussian processes☆12May 30, 2022Updated 3 years ago
- Repository for Deep Structural Causal Models for Tractable Counterfactual Inference☆298Jul 6, 2023Updated 2 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago
- This repository contains the code used in the paper Evaluating the Performance of Reinformcent Learning Algorithms☆27Aug 14, 2021Updated 4 years ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆38Aug 11, 2024Updated last year
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- General-purpose library for extracting interpretable models from Multi-Agent Reinforcement Learning systems☆22May 10, 2020Updated 5 years ago
- ☆14Oct 29, 2018Updated 7 years ago
- Simulation code for reference with MABUC article: Bareinboim, Forney, & Pearl (2015)☆18Nov 11, 2015Updated 10 years ago
- ☆18Apr 25, 2023Updated 3 years ago
- Time-Aware Transformer-based Network for Clinical Notes Series Prediction☆24Jan 20, 2024Updated 2 years ago
- Decoupling Dynamics and Reward for Transfer Learning☆16Sep 7, 2018Updated 7 years ago