hakuhodo-technologies/scope-rl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hakuhodo-technologies/scope-rl)

hakuhodo-technologies / scope-rl

SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection

☆143

Alternatives and similar repositories for scope-rl

Users that are interested in scope-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

st-tech / zr-obp
View on GitHub
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
☆704Jun 3, 2024Updated 2 years ago
sony / pyIEOE
View on GitHub
☆32Feb 21, 2025Updated last year
taikinman / imker
View on GitHub
An easy-to-use ML pipeline package for Python inspired by scikit-learn pipeline and PyTorch layers.
☆12Aug 27, 2023Updated 2 years ago
takuseno / d3rlpy
View on GitHub
An offline deep reinforcement learning library
☆1,674Sep 10, 2025Updated 10 months ago
gsbDBI / contextual_bandits_evaluation
View on GitHub
Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits
☆11Oct 21, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hanjuku-kaso / awesome-offline-rl
View on GitHub
An index of algorithms for offline reinforcement learning (offline-rl)
☆1,073May 23, 2024Updated 2 years ago
Lifelong-ML / offline-compositional-rl-datasets
View on GitHub
☆21Mar 19, 2024Updated 2 years ago
usaito / icml2022-mips
View on GitHub
(ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings
☆22Jul 27, 2022Updated 3 years ago
d5rlbenchmark / d5rl
View on GitHub
☆31Oct 3, 2023Updated 2 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
ghmagazine / ml_design_book
View on GitHub
☆49Sep 26, 2021Updated 4 years ago
smn-ailab / ysaito-qiita
View on GitHub
qiita記事用の実験ファイルや実装したツール群
☆16Apr 1, 2019Updated 7 years ago
ghmagazine / cfml_book
View on GitHub
☆26Apr 14, 2024Updated 2 years ago
aiueola / wsdm2022-cascade-dr
View on GitHub
(WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"
☆13Jul 16, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
usaito / dr-ranking-metric
View on GitHub
(RecSys2020) "Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions"
☆23Mar 25, 2023Updated 3 years ago
Mathtodon / Contextual_Bandits_Tree
View on GitHub
Contains Code for Contextual Bandits Decision Tree
☆21Jun 11, 2019Updated 7 years ago
ToyotaResearchInstitute / common_robotics_utilities
View on GitHub
Common utility functions and algorithms for robotics work used by ARC & ARM labs and TRI. This is a mirror of https://github.com/calderpg…
☆14Jun 23, 2026Updated last month
kpertsch / star
View on GitHub
Official implementation of "Cross-Domain Transfer via Semantic Skill Imitation", Pertsch et al., CoRL 2022
☆15Dec 15, 2022Updated 3 years ago
rjagerman / sigir2020
View on GitHub
Accelerated Confergence for Counterfactual Learning to Rank
☆17Jan 21, 2022Updated 4 years ago
nissymori / JAX-CORL
View on GitHub
Clean single-file implementation of offline RL algorithms in JAX
☆182Jun 5, 2026Updated last month
hand10ryo / PyTorchCML
View on GitHub
PyTorchCML is a library of PyTorch implementations of matrix factorization (MF) and collaborative metric learning (CML), algorithms used …
☆20Jul 16, 2022Updated 4 years ago
kwakaba / mlpr-class
View on GitHub
☆11Feb 26, 2026Updated 5 months ago
rguo12 / KDD24-Conformal
View on GitHub
Code for Conformal Counterfactual Inference under Hidden Confounding (KDD’24)
☆11Aug 30, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
daturkel / sd_bandits
View on GitHub
☆15Dec 14, 2020Updated 5 years ago
usaito / recsys2021-tutorial
View on GitHub
https://sites.google.com/cornell.edu/recsys2021tutorial
☆58Mar 21, 2022Updated 4 years ago
tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
justinjfu / diagnosing_qlearning
View on GitHub
Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.
☆17May 14, 2019Updated 7 years ago
murashitas / iceberg_book_handson
View on GitHub
☆28Oct 13, 2025Updated 9 months ago
corl-team / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆43Aug 22, 2023Updated 2 years ago
usaito / kdd2022-tutorial
View on GitHub
☆12Aug 13, 2022Updated 3 years ago
RicardDurall / Benchmarking-Strategies-for-Asset-Allocation
View on GitHub
☆27Sep 25, 2022Updated 3 years ago
usaito / unbiased-implicit-rec-real
View on GitHub
(WSDM2020) "Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback"
☆30Nov 21, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
orybkin / lexa-benchmark
View on GitHub
☆42May 11, 2022Updated 4 years ago
ml-feedback-sys / materials-f23
View on GitHub
☆10Nov 15, 2023Updated 2 years ago
banditml / offline-policy-evaluation
View on GitHub
Implementations and examples of common offline policy evaluation methods in Python.
☆220Feb 11, 2023Updated 3 years ago
zhihou7 / dit_policy_vla
View on GitHub
☆16Mar 26, 2025Updated last year
catalyst-team / hydra-slayer
View on GitHub
☆16Jan 4, 2024Updated 2 years ago
nomuramasahir0 / crfmnes
View on GitHub
(CEC2022) Fast Moving Natural Evolution Strategy for High-Dimensional Problems
☆19Apr 13, 2026Updated 3 months ago
facebookresearch / denoised_mdp
View on GitHub
Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"
☆140Aug 15, 2023Updated 2 years ago