hakuhodo-technologies / scope-rlView external linksLinks
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
☆133Mar 18, 2024Updated last year
Alternatives and similar repositories for scope-rl
Users that are interested in scope-rl are comparing it to the libraries listed below
Sorting:
- Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation☆691Jun 3, 2024Updated last year
- An easy-to-use ML pipeline package for Python inspired by scikit-learn pipeline and PyTorch layers.☆12Aug 27, 2023Updated 2 years ago
- An offline deep reinforcement learning library☆1,639Sep 10, 2025Updated 5 months ago
- An index of algorithms for offline reinforcement learning (offline-rl)☆1,048May 23, 2024Updated last year
- Official implementation of "Cross-Domain Transfer via Semantic Skill Imitation", Pertsch et al., CoRL 2022☆14Dec 15, 2022Updated 3 years ago
- ☆32Feb 21, 2025Updated 11 months ago
- Clean single-file implementation of offline RL algorithms in JAX☆167Nov 24, 2025Updated 2 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- ☆18Mar 28, 2023Updated 2 years ago
- ☆20Mar 19, 2024Updated last year
- ☆29Oct 3, 2023Updated 2 years ago
- PyTorchCML is a library of PyTorch implementations of matrix factorization (MF) and collaborative metric learning (CML), algorithms used …☆20Jul 16, 2022Updated 3 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- ☆11Oct 19, 2023Updated 2 years ago
- Code for Conformal Counterfactual Inference under Hidden Confounding (KDD’24)☆11Aug 30, 2024Updated last year
- Code for An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality (ICLR 2020)☆11Mar 24, 2023Updated 2 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆10Oct 21, 2024Updated last year
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Code for using the Grasp Affordance Reasoning dataset☆10Sep 17, 2019Updated 6 years ago
- (RecSys2020) "Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions"☆24Mar 25, 2023Updated 2 years ago
- ☆25Apr 14, 2024Updated last year
- Implementation of the work Variational multiple shooting for Bayesian ODEs with Gaussian processes☆13Aug 5, 2022Updated 3 years ago
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Jul 16, 2023Updated 2 years ago
- ☆16Jun 30, 2020Updated 5 years ago
- ☆12Aug 13, 2022Updated 3 years ago
- JVRC1 model files for MuJoCo☆10Apr 8, 2025Updated 10 months ago
- Implementation of the BasePlanE models and the experiments from the NeurIPS 2023 paper "PlanE: Representation Learning over Planar Graphs…☆13Jan 27, 2024Updated 2 years ago
- A sandbox about python☆11Nov 16, 2021Updated 4 years ago
- Implementation of BC-IRL and other IRL baselines☆28Jun 6, 2023Updated 2 years ago
- ☆28Sep 12, 2022Updated 3 years ago
- Common utility functions and algorithms for robotics work used by ARC & ARM labs and TRI. This is a mirror of https://github.com/calderpg…☆13Dec 8, 2025Updated 2 months ago
- ☆11Mar 17, 2024Updated last year
- Toolkit of Causal Model-based Reinforcement Learning.☆33Jun 5, 2023Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆31Sep 10, 2025Updated 5 months ago
- ☆48Sep 26, 2021Updated 4 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆93Dec 1, 2024Updated last year
- Theano☆11Aug 26, 2017Updated 8 years ago
- ☆10Jun 26, 2025Updated 7 months ago
- ☆15Dec 14, 2020Updated 5 years ago