Data-driven offline simulation for online reinforcement learning: benchmark and baselines
☆31Jul 25, 2024Updated last year
Alternatives and similar repositories for rl-offline-simulation
Users that are interested in rl-offline-simulation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A fork of adoptium/aqa-tests with Msft specific changes☆12Apr 11, 2026Updated 2 weeks ago
- A self-supervised learning approach based on extremely large masking☆31Dec 19, 2022Updated 3 years ago
- Fault-aware neural code rankers☆32Dec 9, 2022Updated 3 years ago
- Behavioural cloning solution to MineRL2020 competition☆18Mar 6, 2021Updated 5 years ago
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆17Jan 3, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- MySQL Tools Service that provides MySQL Server data management capabilities.☆22Jun 11, 2024Updated last year
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 6 months ago
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆22Jul 27, 2022Updated 3 years ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆53Jun 12, 2023Updated 2 years ago
- Evaluating different engineering tricks that make RL work☆15Jun 3, 2021Updated 4 years ago
- Logarithmic Reinforcement Learning☆28Apr 7, 2023Updated 3 years ago
- Tooling for streaming instrument data☆34Apr 22, 2026Updated last week
- UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning☆61Jun 12, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003☆11Oct 6, 2022Updated 3 years ago
- Azure Object Detection Accelerator. A repo for quickly and easily setting up a sample object detection project with training, labelling, …☆20May 23, 2023Updated 2 years ago
- Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"☆54Jan 26, 2024Updated 2 years ago
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- MacTok is a research prototype for a one-time anonymous token scheme based on algebraic MACs.☆23Apr 22, 2026Updated last week
- Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation [ICML 2021]☆15Jul 17, 2025Updated 9 months ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆23Jun 2, 2025Updated 10 months ago
- This repository explores a variety of data visualization techniques, with a particular focus on applications in the hospitality domain. I…☆42Oct 16, 2025Updated 6 months ago
- ☆14Jun 7, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"☆18Jul 20, 2023Updated 2 years ago
- Evaluating methods to improve model transfer for intensive care unit models☆16Jul 6, 2023Updated 2 years ago
- Edutainment game teaching players concepts around machine learning☆15Feb 18, 2020Updated 6 years ago
- DeFacto - Demonstrations and Feedback for improving factual consistency of text summarization☆30Dec 19, 2022Updated 3 years ago
- A tool for gender bias identification in text. Part of Microsoft's Responsible AI toolbox.☆50Aug 20, 2024Updated last year
- ☆23Jun 7, 2023Updated 2 years ago
- Development of parametric, deep learning, and reinforcement learning agent-based model of car-following behaviour. The models aim to be d…☆25Nov 18, 2019Updated 6 years ago
- A repository for managing workshop contents for learning Microsoft Azure's data analytics platform with a focus on Databricks SQL and Syn…☆21Jul 4, 2023Updated 2 years ago
- Used Flow, Ray/RLlib and OpenAI Gym to simulate and train autonomous vehicles/human drivers in SUMO (Simulation of Urban Mobility)☆25Dec 15, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems☆48Aug 19, 2022Updated 3 years ago
- Github for the NIPS 2020 paper "Learning outside the black-box: at the pursuit of interpretable models"☆14Sep 7, 2022Updated 3 years ago
- This is the implementation of the TextNAS algorithm proposed in the paper TextNAS: A Neural Architecture Search Space tailored for Text R…☆15Nov 28, 2022Updated 3 years ago
- Aquatic navigation environments for Gym☆20Sep 11, 2024Updated last year
- Learn Mithril.js by seeing, reviewing, and running up-to-date code examples☆13Jun 17, 2023Updated 2 years ago
- Quick useful examples of data science & ML & big data☆16Jun 12, 2023Updated 2 years ago
- Python library for real-time control of a robotic manipulator☆21Feb 7, 2023Updated 3 years ago