banditml/offline-policy-evaluation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/banditml/offline-policy-evaluation)

banditml / offline-policy-evaluation

Implementations and examples of common offline policy evaluation methods in Python.

☆220

Alternatives and similar repositories for offline-policy-evaluation

Users that are interested in offline-policy-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

banditml / banditml
View on GitHub
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
☆71Jun 4, 2021Updated 5 years ago
st-tech / zr-obp
View on GitHub
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
☆704Jun 3, 2024Updated 2 years ago
usaito / recsys2021-tutorial
View on GitHub
https://sites.google.com/cornell.edu/recsys2021tutorial
☆58Mar 21, 2022Updated 4 years ago
google-research / deep_ope
View on GitHub
☆88Jul 30, 2024Updated last year
aiueola / wsdm2022-cascade-dr
View on GitHub
(WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"
☆13Jul 16, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
counterfactual-ml / kdd2022-tutorial
View on GitHub
Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances
☆12Aug 14, 2022Updated 3 years ago
david-cortes / contextualbandits
View on GitHub
Python implementations of contextual bandits algorithms
☆838Jun 28, 2026Updated 3 weeks ago
sony / pyIEOE
View on GitHub
☆32Feb 21, 2025Updated last year
MaxHalford / naked
View on GitHub
The simplest way to deploy a machine learning model
☆23Nov 19, 2022Updated 3 years ago
olivierjeunen / pessimism-recsys-2021
View on GitHub
Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.
☆11Dec 15, 2022Updated 3 years ago
VowpalWabbit / estimators
View on GitHub
Estimators to perform off-policy evaluation
☆13Sep 3, 2024Updated last year
olivierjeunen / EARS-recsys-2021
View on GitHub
Source code for our paper "Top-K Contextual Bandits with Equity of Exposure" published at RecSys 2021.
☆15Aug 2, 2021Updated 4 years ago
facebookresearch / ReAgent
View on GitHub
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
☆3,708Updated this week
daturkel / sd_bandits
View on GitHub
☆15Dec 14, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hanjuku-kaso / awesome-offline-rl
View on GitHub
An index of algorithms for offline reinforcement learning (offline-rl)
☆1,073May 23, 2024Updated 2 years ago
fidelity / mabwiser
View on GitHub
MABWiser: Contextual Multi-Armed Bandits Library
☆287Sep 5, 2024Updated last year
rjagerman / sigir2020
View on GitHub
Accelerated Confergence for Counterfactual Learning to Rank
☆17Jan 21, 2022Updated 4 years ago
gsbDBI / contextual_bandits_evaluation
View on GitHub
Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits
☆11Oct 21, 2024Updated last year
theophilegervet / discrete-off-policy-evaluation
View on GitHub
Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.
☆16Mar 28, 2020Updated 6 years ago
SMPyBandits / SMPyBandits
View on GitHub
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…
☆424Jun 19, 2026Updated last month
peterhurford / vowpal_platypus
View on GitHub
Fast, accurate, lightweight, multi-core ML in Python, leveraging Vowpal Wabbit
☆21May 26, 2018Updated 8 years ago
tlentali / leab
View on GitHub
📈🔍 Lets Python do AB testing analysis.
☆75Apr 15, 2025Updated last year
google-research / recsim_ng
View on GitHub
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
☆127Apr 26, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Kenza-AI / mab-ranking
View on GitHub
Online Ranking with Multi-Armed-Bandits
☆19Sep 4, 2021Updated 4 years ago
HCDM / BanditLib
View on GitHub
Library of contextual bandits algorithms
☆343Mar 14, 2024Updated 2 years ago
MehdiZouitine / gym_ma_toy
View on GitHub
Toy environment set for multi-agent reinforcement learning and more
☆38Nov 26, 2024Updated last year
ntucllab / striatum
View on GitHub
Contextual bandit in python
☆112Jul 7, 2021Updated 5 years ago
VowpalWabbit / coba
View on GitHub
Contextual bandit benchmarking
☆53Jan 21, 2026Updated 6 months ago
clvoloshin / COBS
View on GitHub
OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.
☆61Aug 9, 2022Updated 3 years ago
irecsys / Tutorial_MSRS
View on GitHub
Tutorial for Multi-Stakeholder Recommender Systems
☆22Aug 23, 2021Updated 4 years ago
criteo-research / bandit-reco
View on GitHub
☆51Jan 3, 2021Updated 5 years ago
hakuhodo-technologies / scope-rl
View on GitHub
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
☆143Mar 18, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
carbonfact / munpack
View on GitHub
📊 Explain why metrics change by unpacking them
☆42Jan 16, 2026Updated 6 months ago
victusfate / concierge
View on GitHub
real time recommendation playground
☆15Nov 7, 2022Updated 3 years ago
BestActionNow / Slate_Aware_Ranking
View on GitHub
The implementation for our paper "Slate-Aware Ranking for Recommendation" accepted by WSDM.23
☆16Dec 13, 2022Updated 3 years ago
EleutherAI / equivariance
View on GitHub
A framework for implementing equivariant DL
☆10May 25, 2021Updated 5 years ago
ma3oun / hrn
View on GitHub
Hash-routed Networks
☆19Nov 20, 2020Updated 5 years ago
ExpediaGroup / map-maker
View on GitHub
Map maker is a command line tool and library for easily generating maps from structured data.
☆16Mar 5, 2024Updated 2 years ago
usaito / kdd2022-tutorial
View on GitHub
☆12Aug 13, 2022Updated 3 years ago