SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
☆141Mar 18, 2024Updated 2 years ago
Alternatives and similar repositories for scope-rl
Users that are interested in scope-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation☆706Jun 3, 2024Updated last year
- ☆32Feb 21, 2025Updated last year
- An easy-to-use ML pipeline package for Python inspired by scikit-learn pipeline and PyTorch layers.☆12Aug 27, 2023Updated 2 years ago
- An offline deep reinforcement learning library☆1,659Sep 10, 2025Updated 7 months ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆11Oct 21, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An index of algorithms for offline reinforcement learning (offline-rl)☆1,065May 23, 2024Updated last year
- ☆21Mar 19, 2024Updated 2 years ago
- ☆30Oct 3, 2023Updated 2 years ago
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆22Jul 27, 2022Updated 3 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- ☆49Sep 26, 2021Updated 4 years ago
- qiita記事用の実験ファイルや実装したツール群☆16Apr 1, 2019Updated 7 years ago
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Jul 16, 2023Updated 2 years ago
- ☆27Apr 14, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (RecSys2020) "Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions"☆23Mar 25, 2023Updated 3 years ago
- Contains Code for Contextual Bandits Decision Tree☆21Jun 11, 2019Updated 6 years ago
- Official implementation of "Cross-Domain Transfer via Semantic Skill Imitation", Pertsch et al., CoRL 2022☆15Dec 15, 2022Updated 3 years ago
- Clean single-file implementation of offline RL algorithms in JAX☆175Nov 24, 2025Updated 5 months ago
- Accelerated Confergence for Counterfactual Learning to Rank☆17Jan 21, 2022Updated 4 years ago
- PyTorchCML is a library of PyTorch implementations of matrix factorization (MF) and collaborative metric learning (CML), algorithms used …☆20Jul 16, 2022Updated 3 years ago
- Code for Conformal Counterfactual Inference under Hidden Confounding (KDD’24)☆11Aug 30, 2024Updated last year
- ☆11Feb 26, 2026Updated 2 months ago
- ☆15Dec 14, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Metadata browser of TREC☆10Updated this week
- ☆12Aug 13, 2022Updated 3 years ago
- (WSDM2020) "Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback"☆30Nov 21, 2022Updated 3 years ago
- ☆42May 11, 2022Updated 3 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- ☆27Sep 25, 2022Updated 3 years ago
- Implementations and examples of common offline policy evaluation methods in Python.☆224Feb 11, 2023Updated 3 years ago
- ☆10Nov 15, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)☆15Feb 20, 2023Updated 3 years ago
- Implementation of the work Variational multiple shooting for Bayesian ODEs with Gaussian processes☆14Aug 5, 2022Updated 3 years ago
- ☆16Jan 4, 2024Updated 2 years ago
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆139Aug 15, 2023Updated 2 years ago
- Uses simple Bayesian conjugate prior update rules to calculate metrics for various marketing objectives☆11Oct 9, 2023Updated 2 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆33Jun 3, 2023Updated 2 years ago
- Theano☆11Aug 26, 2017Updated 8 years ago