theophilegervet / discrete-off-policy-evaluationLinks
Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.
☆16Updated 5 years ago
Alternatives and similar repositories for discrete-off-policy-evaluation
Users that are interested in discrete-off-policy-evaluation are comparing it to the libraries listed below
Sorting:
- Structural Causal Bandit☆25Updated 4 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- Code for Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding☆23Updated 2 years ago
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆21Updated 3 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Updated 4 years ago
- Contextual Bandits Action Elimination DQN☆21Updated 7 years ago
- ☆88Updated last year
- Public Release of Plan2vec Implementation in pyTorch☆57Updated 2 years ago
- Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)☆46Updated 5 years ago
- Code for "Neural causal learning from unknown interventions"☆104Updated 5 years ago
- PreferenceNet: Encoding Human Preferences in Auction Design With Deep Learning☆17Updated 4 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 3 years ago
- Project on Causal Machine learning CS 7290☆16Updated 5 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆40Updated last year
- ☆13Updated 5 months ago
- (RecSys2020) "Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions"☆24Updated 2 years ago
- Code for the paper "Optimal Off-Policy Evaluation from Multiple Logging Policies"☆15Updated 4 years ago
- ☆51Updated last year
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- Code for NeurIPS 2021 paper: "Invariant Causal Imitation Learning for Generalizable Policies" by I. Bica, D. Jarrett, M. van der Schaar☆28Updated 3 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Updated 5 years ago
- Code and data for "Deep Reinforcement Learning of Marked Temporal Point Processes", NeurIPS 2018☆81Updated 6 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆19Updated 6 years ago
- Estimators to perform off-policy evaluation☆13Updated last year
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Updated 5 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 8 years ago
- Public repository for the work on bandit problems☆23Updated last year
- Source code for Hierarchical Probabilistic Forecasting of Electricity Demand with Smart Meter Data by Ben Taieb, Souhaib, Taylor, James, …☆10Updated 6 years ago