microsoft/rl-offline-simulation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/rl-offline-simulation)

microsoft / rl-offline-simulation

Data-driven offline simulation for online reinforcement learning: benchmark and baselines

☆31

Alternatives and similar repositories for rl-offline-simulation

Users that are interested in rl-offline-simulation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / aqa-tests
View on GitHub
A fork of adoptium/aqa-tests with Msft specific changes
☆12Jun 29, 2026Updated 3 weeks ago
Miffyli / minecraft-bc-2020
View on GitHub
Behavioural cloning solution to MineRL2020 competition
☆18Mar 6, 2021Updated 5 years ago
microsoft / HuRL
View on GitHub
Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper
☆17Jan 3, 2022Updated 4 years ago
chscheller / minerl_agent
View on GitHub
3rd placed submission to the NeurIPS MineRL competition 2019
☆10Mar 24, 2023Updated 3 years ago
usaito / icml2022-mips
View on GitHub
(ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings
☆22Jul 27, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
microsoft / strategically_efficient_rl
View on GitHub
More efficient exploration for reinforcement learning in two-player, zero-sum game
☆21Jul 30, 2024Updated last year
microsoft / mysqltoolsservice
View on GitHub
MySQL Tools Service that provides MySQL Server data management capabilities.
☆22Jun 11, 2024Updated 2 years ago
Miffyli / rl-human-prior-tricks
View on GitHub
Evaluating different engineering tricks that make RL work
☆15Jun 3, 2021Updated 5 years ago
microsoft / klite
View on GitHub
[NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222
☆54Jun 12, 2023Updated 3 years ago
microsoft / Lightweight-Low-Resource-NMT
View on GitHub
Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…
☆18Oct 9, 2025Updated 9 months ago
microsoft / logrl
View on GitHub
Logarithmic Reinforcement Learning
☆28Apr 7, 2023Updated 3 years ago
microsoft / yardl
View on GitHub
Tooling for streaming instrument data
☆35Jul 10, 2026Updated last week
microsoft / UniSumm
View on GitHub
UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning
☆61Jun 12, 2023Updated 3 years ago
MLD3 / OfflineRL_ModelSelection
View on GitHub
[MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003
☆11Oct 6, 2022Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
microsoft / platformer-ml-game
View on GitHub
Edutainment game teaching players concepts around machine learning
☆15Feb 18, 2020Updated 6 years ago
microsoft / dstoolkit-azoda
View on GitHub
Azure Object Detection Accelerator. A repo for quickly and easily setting up a sample object detection project with training, labelling, …
☆20May 23, 2023Updated 3 years ago
sony / pyIEOE
View on GitHub
☆32Feb 21, 2025Updated last year
microsoft / smart
View on GitHub
Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"
☆54Jan 26, 2024Updated 2 years ago
microsoft / autorl-research
View on GitHub
The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.
☆63Jul 22, 2025Updated 11 months ago
MLD3 / OfflineRL_FactoredActions
View on GitHub
[NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738
☆11Nov 27, 2022Updated 3 years ago
dtak / POPCORN-POMDP
View on GitHub
Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)
☆11May 19, 2021Updated 5 years ago
microsoft / NTT
View on GitHub
Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation [ICML 2021]
☆14Jul 17, 2025Updated last year
MLD3 / RL4BG
View on GitHub
Public code release for "Deep Reinforcement Learning for Closed-Loop Blood Glucose Control" (Ian Fox et al.), MLHC 2020. https://arxiv.or…
☆13Feb 5, 2021Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
microsoft / MAMBA
View on GitHub
Imitation learning from multiple experts
☆13Aug 29, 2022Updated 3 years ago
microsoft / MacTok
View on GitHub
MacTok is a research prototype for a one-time anonymous token scheme based on algebraic MACs.
☆23Apr 22, 2026Updated 2 months ago
clinicalml / trajectory-inspection
View on GitHub
Code for "Trajectory Inspection: A Method for Iterative Clinician-Driven Design of Reinforcement Learning Studies"
☆16Oct 15, 2020Updated 5 years ago
microsoft / data-science-examples
View on GitHub
Quick useful examples of data science & ML & big data
☆16Jun 12, 2023Updated 3 years ago
GilesLuo / ReassessDTR
View on GitHub
☆14Jun 7, 2024Updated 2 years ago
microsoft / EMNLP2019-Split-And-Recombine
View on GitHub
The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"
☆18Jul 20, 2023Updated 3 years ago
microsoft / BuildAnIntelligentBot
View on GitHub
This is the sample of the Talk to My Bot implementation of a smart bot that can interact with other bots.
☆25Jun 27, 2023Updated 3 years ago
JonathanCrabbe / Symbolic-Pursuit
View on GitHub
Github for the NIPS 2020 paper "Learning outside the black-box: at the pursuit of interpretable models"
☆14Sep 7, 2022Updated 3 years ago
microsoft / SandboxSecurityTools
View on GitHub
Security testing tools for Windows sandboxing technologies
☆189May 5, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
alistairewj / icu-model-transfer
View on GitHub
Evaluating methods to improve model transfer for intensive care unit models
☆16Jul 6, 2023Updated 3 years ago
microsoft / openpai-runtime
View on GitHub
Runtime for deep learning workload
☆21May 24, 2022Updated 4 years ago
Urban-Analytics / data-driven-car-following
View on GitHub
Development of parametric, deep learning, and reinforcement learning agent-based model of car-following behaviour. The models aim to be d…
☆25Nov 18, 2019Updated 6 years ago
Matticusau / projectcard-autolabel
View on GitHub
GitHub Action to automatically assign labels as the project card moves between columns of a project board
☆15Feb 9, 2021Updated 5 years ago
microsoft / responsible-ai-toolbox-genbit
View on GitHub
A tool for gender bias identification in text. Part of Microsoft's Responsible AI toolbox.
☆51Aug 20, 2024Updated last year
microsoft / HyperdriveDeepLearning
View on GitHub
Hyperparameter Tuning for Deep Learning
☆16Feb 5, 2020Updated 6 years ago
microsoft / PLOG
View on GitHub
☆23Jun 7, 2023Updated 3 years ago