hakuhodo-technologies / scope-rl
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
☆120Updated 11 months ago
Alternatives and similar repositories for scope-rl:
Users that are interested in scope-rl are comparing it to the libraries listed below
- An out-of-the-box GUI tool for offline deep reinforcement learning☆99Updated 3 years ago
- ☆31Updated 3 weeks ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆113Updated 6 months ago
- ☆85Updated 7 months ago
- Deep reinforcement learning with tensorflow2☆93Updated 2 weeks ago
- 勉強した内容のアウトプット用☆46Updated this week
- ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives (Deep RL Workshop 2021)☆46Updated 3 years ago
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆148Updated 3 years ago
- Fast Flexible Replay Buffer Library (Mirror repository of https://gitlab.com/ymd_h/cpprb)☆72Updated 3 months ago
- Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation☆656Updated 9 months ago
- Simple Distributed Reinforcement Learning Framework(シンプルな分散強化学習フレームワーク)☆47Updated this week
- Deep reinforcement learning library built on top of Neural Network Libraries☆123Updated 2 months ago
- An easy-to-use reinforcement learning library for research and education.☆166Updated last week
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆20Updated 2 years ago
- Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)☆51Updated 3 years ago
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own…☆287Updated 2 weeks ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆61Updated 2 years ago
- Clean single-file implementation of offline RL algorithms in JAX☆134Updated 2 months ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆92Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆174Updated 2 years ago
- ☆44Updated 2 years ago
- Example implementation of Alpha Zero' s algotirhm on Jupyter notebook☆15Updated 5 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆75Updated last year
- Benchmarking RL generalization in an interpretable way.☆147Updated this week
- ☆120Updated last year
- [ICLR 2021] Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments.☆57Updated last year
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- Deep Hierarchical Planning from Pixels☆94Updated 2 years ago
- AI for google research football☆27Updated 4 years ago