google-research/deep_ope

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-research/deep_ope)

google-research / deep_ope

☆88

Alternatives and similar repositories for deep_ope

Users that are interested in deep_ope are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

clvoloshin / COBS
View on GitHub
OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.
☆61Aug 9, 2022Updated 3 years ago
counterfactual-ml / kdd2022-tutorial
View on GitHub
Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances
☆12Aug 14, 2022Updated 3 years ago
daturkel / sd_bandits
View on GitHub
☆15Dec 14, 2020Updated 5 years ago
aviralkumar2907 / CQL
View on GitHub
Code for conservative Q-learning
☆488Dec 7, 2021Updated 4 years ago
tianheyu927 / mopo
View on GitHub
Code for MOPO: Model-based Offline Policy Optimization
☆191May 17, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
adith387 / slates_semisynth_expts
View on GitHub
Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.
☆43Nov 2, 2017Updated 8 years ago
justinjfu / diagnosing_qlearning
View on GitHub
Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.
☆17May 14, 2019Updated 7 years ago
matsuolab / BREMEN
View on GitHub
Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)
☆54Jul 7, 2021Updated 5 years ago
google-research / dice_rl
View on GitHub
☆114Jul 3, 2026Updated 2 weeks ago
olivierjeunen / pessimism-recsys-2021
View on GitHub
Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.
☆11Dec 15, 2022Updated 3 years ago
yihaosun1124 / pytorch-mopo
View on GitHub
re-implementation of the offline model-based RL algorithm MOPO in pytorch
☆26Feb 28, 2022Updated 4 years ago
sony / pyIEOE
View on GitHub
☆32Feb 21, 2025Updated last year
VowpalWabbit / estimators
View on GitHub
Estimators to perform off-policy evaluation
☆13Sep 3, 2024Updated last year
olivierjeunen / EARS-recsys-2021
View on GitHub
Source code for our paper "Top-K Contextual Bandits with Equity of Exposure" published at RecSys 2021.
☆15Aug 2, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
hanjuku-kaso / awesome-offline-rl
View on GitHub
An index of algorithms for offline reinforcement learning (offline-rl)
☆1,073May 23, 2024Updated 2 years ago
Farama-Foundation / D4RL-Evaluations
View on GitHub
☆203Mar 25, 2023Updated 3 years ago
facebookresearch / icp-block-mdp
View on GitHub
Invariant Causal Prediction for Block MDPs
☆44Jun 11, 2020Updated 6 years ago
aiueola / wsdm2022-cascade-dr
View on GitHub
(WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"
☆13Jul 16, 2023Updated 3 years ago
Farama-Foundation / D4RL
View on GitHub
A collection of reference environments for offline reinforcement learning
☆1,695Nov 18, 2024Updated last year
google-research / batch_rl
View on GitHub
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
☆560Jun 26, 2023Updated 3 years ago
denisyarats / exorl
View on GitHub
ExORL: Exploratory Data for Offline Reinforcement Learning
☆137Feb 8, 2022Updated 4 years ago
apple / ml-uwac
View on GitHub
☆35Jul 10, 2021Updated 5 years ago
sgiguere / RobinHood-NeurIPS-2019
View on GitHub
Implementation of safe offline bandit algorithms.
☆10Oct 27, 2019Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
avisingh599 / cog
View on GitHub
[CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning
☆35Oct 28, 2020Updated 5 years ago
usaito / recsys2021-tutorial
View on GitHub
https://sites.google.com/cornell.edu/recsys2021tutorial
☆58Mar 21, 2022Updated 4 years ago
KyriacosShiarli / taco
View on GitHub
☆25Jan 2, 2019Updated 7 years ago
sfujim / BCQ
View on GitHub
Author's PyTorch implementation of BCQ for continuous and discrete actions
☆667Apr 6, 2021Updated 5 years ago
Sea-Snell / CALM-Dialogue
View on GitHub
Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"
☆34Dec 9, 2022Updated 3 years ago
SwapnilPande / MOReL
View on GitHub
Model-Based Offline Reinforcement Learning
☆51Jan 13, 2021Updated 5 years ago
joelouismarino / variational_rl
View on GitHub
Variational Reinforcement Learning
☆17Jul 25, 2024Updated last year
HumanCompatibleAI / eirli
View on GitHub
An Empirical Investigation of Representation Learning for Imitation (EIRLI), NeurIPS'21
☆37Mar 4, 2023Updated 3 years ago
sfujim / TD3_BC
View on GitHub
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆410Dec 18, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
banditml / offline-policy-evaluation
View on GitHub
Implementations and examples of common offline policy evaluation methods in Python.
☆220Feb 11, 2023Updated 3 years ago
usaito / icml2022-mips
View on GitHub
(ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings
☆22Jul 27, 2022Updated 3 years ago
google-research / realworldrl_suite
View on GitHub
Real-World RL Benchmark Suite
☆365Aug 11, 2020Updated 5 years ago
MLD3 / OfflineRL_ModelSelection
View on GitHub
[MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003
☆11Oct 6, 2022Updated 3 years ago
HxLyn3 / Machine-Learning
View on GitHub
Some notes and solutions to "Machine Learning" authored by Zhi-Hua Zhou
☆11Jul 20, 2021Updated 5 years ago
HxLyn3 / MPPVE
View on GitHub
☆10Sep 19, 2023Updated 2 years ago
CausalML / MultipleLoggers
View on GitHub
Code for the paper "Optimal Off-Policy Evaluation from Multiple Logging Policies"
☆15Jul 17, 2021Updated 5 years ago