The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.
☆62Jul 22, 2025Updated 7 months ago
Alternatives and similar repositories for autorl-research
Users that are interested in autorl-research are comparing it to the libraries listed below
Sorting:
- ☆20Mar 19, 2024Updated last year
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Dec 30, 2022Updated 3 years ago
- MacTok is a research prototype for a one-time anonymous token scheme based on algebraic MACs.☆23Jan 20, 2023Updated 3 years ago
- ☆19Jun 25, 2023Updated 2 years ago
- A self-supervised learning approach based on extremely large masking☆31Dec 19, 2022Updated 3 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- Boosting Natural Language Generation from Instructions with Meta-Learning☆11Dec 20, 2022Updated 3 years ago
- Gallery for Industry AI demos☆18May 1, 2023Updated 2 years ago
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated last year
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆53Jun 12, 2023Updated 2 years ago
- Data-driven offline simulation for online reinforcement learning: benchmark and baselines☆31Jul 25, 2024Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Jul 19, 2023Updated 2 years ago
- ☆16Jun 12, 2023Updated 2 years ago
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 4 months ago
- 逻辑回归和单层softmax的解析解☆12Jul 29, 2021Updated 4 years ago
- ☆15Feb 21, 2023Updated 3 years ago
- MySQL Tools Service that provides MySQL Server data management capabilities.☆22Jun 11, 2024Updated last year
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆137Aug 15, 2023Updated 2 years ago
- A probabilistic programming language, based on Church☆17Oct 11, 2017Updated 8 years ago
- Zero-Shot Summarization with GPT-3☆17Sep 11, 2023Updated 2 years ago
- Azure Object Detection Accelerator. A repo for quickly and easily setting up a sample object detection project with training, labelling, …☆20May 23, 2023Updated 2 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆15Mar 9, 2021Updated 4 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- The first collection of surrogate benchmarks for Joint Architecture and Hyperparameter Search.☆15Mar 22, 2023Updated 2 years ago
- Flow RL is a high-performance RL library with flow and diffusion models.☆28Updated this week
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆86Nov 27, 2023Updated 2 years ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆288Jun 10, 2022Updated 3 years ago
- A repository for managing workshop contents for learning Microsoft Azure's data analytics platform with a focus on Databricks SQL and Syn…☆21Jul 4, 2023Updated 2 years ago
- ☆18Feb 7, 2026Updated last month
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- Official repo for Offline RL for Online RL☆19Oct 14, 2023Updated 2 years ago
- Paper List for In-context Learning 🌷☆20Jan 3, 2023Updated 3 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Nov 22, 2022Updated 3 years ago
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆86Apr 4, 2025Updated 11 months ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated 11 months ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆49Dec 21, 2022Updated 3 years ago
- HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient☆27Jan 14, 2026Updated last month
- PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).☆20Mar 4, 2023Updated 3 years ago