Data-driven offline simulation for online reinforcement learning: benchmark and baselines
☆31Jul 25, 2024Updated last year
Alternatives and similar repositories for rl-offline-simulation
Users that are interested in rl-offline-simulation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A self-supervised learning approach based on extremely large masking☆31Dec 19, 2022Updated 3 years ago
- Fault-aware neural code rankers☆32Dec 9, 2022Updated 3 years ago
- Behavioural cloning solution to MineRL2020 competition☆18Mar 6, 2021Updated 5 years ago
- Headway - Selenium Maven TestNG POM Data Driven Framework☆18Jul 2, 2025Updated 9 months ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Renee: End-to-end training of extreme classification models☆23Sep 29, 2023Updated 2 years ago
- ☆16Jun 12, 2023Updated 2 years ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Jul 30, 2024Updated last year
- ☆15Feb 21, 2023Updated 3 years ago
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆22Jul 27, 2022Updated 3 years ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆53Jun 12, 2023Updated 2 years ago
- Evaluating different engineering tricks that make RL work☆15Jun 3, 2021Updated 4 years ago
- Logarithmic Reinforcement Learning☆28Apr 7, 2023Updated 3 years ago
- UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning☆61Jun 12, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003☆11Oct 6, 2022Updated 3 years ago
- Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"☆54Jan 26, 2024Updated 2 years ago
- ☆32Feb 21, 2025Updated last year
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- aicreator for aidata☆14May 17, 2023Updated 2 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 4 years ago
- MacTok is a research prototype for a one-time anonymous token scheme based on algebraic MACs.☆23Jan 20, 2023Updated 3 years ago
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆23Jun 2, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"☆18Jul 20, 2023Updated 2 years ago
- Evaluating methods to improve model transfer for intensive care unit models☆16Jul 6, 2023Updated 2 years ago
- Edutainment game teaching players concepts around machine learning☆15Feb 18, 2020Updated 6 years ago
- ☆23Jun 7, 2023Updated 2 years ago
- Development of parametric, deep learning, and reinforcement learning agent-based model of car-following behaviour. The models aim to be d…☆25Nov 18, 2019Updated 6 years ago
- A repository for managing workshop contents for learning Microsoft Azure's data analytics platform with a focus on Databricks SQL and Syn…☆21Jul 4, 2023Updated 2 years ago
- The GitHub browserslist config☆33Nov 21, 2024Updated last year
- AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems☆48Aug 19, 2022Updated 3 years ago
- This is the implementation of the TextNAS algorithm proposed in the paper TextNAS: A Neural Architecture Search Space tailored for Text R…☆15Nov 28, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Aquatic navigation environments for Gym☆20Sep 11, 2024Updated last year
- Quick useful examples of data science & ML & big data☆16Jun 12, 2023Updated 2 years ago
- Python library for real-time control of a robotic manipulator☆21Feb 7, 2023Updated 3 years ago
- Samples for use with MLOps☆13Jul 6, 2023Updated 2 years ago
- This repo is the official implementation of "Mask-based Latent Reconstruction for Reinforcement Learning" (NeurIPS 2022).☆30Jul 6, 2023Updated 2 years ago
- finding set bits in large bitmaps☆15Nov 30, 2015Updated 10 years ago
- python c-module for siphash☆19Updated this week