Data-driven offline simulation for online reinforcement learning: benchmark and baselines
☆31Jul 25, 2024Updated last year
Alternatives and similar repositories for rl-offline-simulation
Users that are interested in rl-offline-simulation are comparing it to the libraries listed below
Sorting:
- A self-supervised learning approach based on extremely large masking☆31Dec 19, 2022Updated 3 years ago
- Behavioural cloning solution to MineRL2020 competition☆18Mar 6, 2021Updated 5 years ago
- CSS Textmate grammar for syntax highlighting☆28Feb 4, 2026Updated last month
- Gallery for Industry AI demos☆18May 1, 2023Updated 2 years ago
- Renee: End-to-end training of extreme classification models☆23Sep 29, 2023Updated 2 years ago
- ☆16Jun 12, 2023Updated 2 years ago
- ☆15Feb 21, 2023Updated 3 years ago
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆21Jul 27, 2022Updated 3 years ago
- Evaluating different engineering tricks that make RL work☆15Jun 3, 2021Updated 4 years ago
- UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning☆61Jun 12, 2023Updated 2 years ago
- Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"☆54Jan 26, 2024Updated 2 years ago
- The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.☆62Jul 22, 2025Updated 7 months ago
- ☆32Feb 21, 2025Updated last year
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- aicreator for aidata☆14May 17, 2023Updated 2 years ago
- MacTok is a research prototype for a one-time anonymous token scheme based on algebraic MACs.☆23Jan 20, 2023Updated 3 years ago
- Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation [ICML 2021]☆15Jul 17, 2025Updated 8 months ago
- Code for "Trajectory Inspection: A Method for Iterative Clinician-Driven Design of Reinforcement Learning Studies"☆16Oct 15, 2020Updated 5 years ago
- Repository for the paper "An Adversarial Approach for the Robust Classification of Pneumonia from Chest Radiographs"☆19Jan 14, 2020Updated 6 years ago
- ☆13Jun 7, 2024Updated last year
- This is the sample of the Talk to My Bot implementation of a smart bot that can interact with other bots.☆26Jun 27, 2023Updated 2 years ago
- Runtime for deep learning workload☆21May 24, 2022Updated 3 years ago
- Edutainment game teaching players concepts around machine learning☆15Feb 18, 2020Updated 6 years ago
- A tool for gender bias identification in text. Part of Microsoft's Responsible AI toolbox.☆50Aug 20, 2024Updated last year
- A repository for managing workshop contents for learning Microsoft Azure's data analytics platform with a focus on Databricks SQL and Syn…☆21Jul 4, 2023Updated 2 years ago
- Used Flow, Ray/RLlib and OpenAI Gym to simulate and train autonomous vehicles/human drivers in SUMO (Simulation of Urban Mobility)☆25Dec 15, 2020Updated 5 years ago
- Hyperparameter Tuning for Deep Learning☆16Feb 5, 2020Updated 6 years ago
- Github for the NIPS 2020 paper "Learning outside the black-box: at the pursuit of interpretable models"☆14Sep 7, 2022Updated 3 years ago
- This is the implementation of the TextNAS algorithm proposed in the paper TextNAS: A Neural Architecture Search Space tailored for Text R…☆15Nov 28, 2022Updated 3 years ago
- Learn Mithril.js by seeing, reviewing, and running up-to-date code examples☆13Jun 17, 2023Updated 2 years ago
- Aquatic navigation environments for Gym☆20Sep 11, 2024Updated last year
- ☆13Aug 10, 2024Updated last year
- Python library for real-time control of a robotic manipulator☆21Feb 7, 2023Updated 3 years ago
- Repository for reproducibility of the CSV file project☆28Jan 20, 2022Updated 4 years ago
- Proof of concept code for poisoning code generation models.☆56Dec 6, 2023Updated 2 years ago
- Python repo for the XDK auto-generated code.☆22Feb 28, 2026Updated 3 weeks ago
- This repo is the official implementation of "Mask-based Latent Reconstruction for Reinforcement Learning" (NeurIPS 2022).☆29Jul 6, 2023Updated 2 years ago
- The code for paper entitled "Data-Driven Modulation Optimization with LMMSE Equalization for Reliability Enhancement in Underwater Acoust…☆19Oct 4, 2025Updated 5 months ago
- Code for Deep Structured Mixtures of Gaussian Processes (DSMGPs)☆11Jan 27, 2022Updated 4 years ago