microsoft/autorl-research

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/autorl-research)

microsoft / autorl-research

The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.

☆62

Alternatives and similar repositories for autorl-research

Users that are interested in autorl-research are comparing it to the libraries listed below

Sorting:

Lifelong-ML / offline-compositional-rl-datasets
View on GitHub
☆20Mar 19, 2024Updated last year
danielshin1 / oprl
View on GitHub
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆20Dec 30, 2022Updated 3 years ago
microsoft / MacTok
View on GitHub
MacTok is a research prototype for a one-time anonymous token scheme based on algebraic MACs.
☆23Jan 20, 2023Updated 3 years ago
hwang-ua / inac_pytorch
View on GitHub
☆19Jun 25, 2023Updated 2 years ago
microsoft / ExtreMA
View on GitHub
A self-supervised learning approach based on extremely large masking
☆31Dec 19, 2022Updated 3 years ago
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 4 years ago
microsoft / NLG_Instructions_MetaLearning
View on GitHub
Boosting Natural Language Generation from Instructions with Meta-Learning
☆11Dec 20, 2022Updated 3 years ago
microsoft / dstoolkit-ai-ux
View on GitHub
Gallery for Industry AI demos
☆18May 1, 2023Updated 2 years ago
beanie00 / Decision-ConvFormer
View on GitHub
[ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"
☆12Apr 22, 2024Updated last year
microsoft / MAMBA
View on GitHub
Imitation learning from multiple experts
☆13Aug 29, 2022Updated 3 years ago
microsoft / klite
View on GitHub
[NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222
☆53Jun 12, 2023Updated 2 years ago
microsoft / rl-offline-simulation
View on GitHub
Data-driven offline simulation for online reinforcement learning: benchmark and baselines
☆31Jul 25, 2024Updated last year
haosulab / RPG
View on GitHub
Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization
☆27Jul 19, 2023Updated 2 years ago
microsoft / SCGLab
View on GitHub
☆16Jun 12, 2023Updated 2 years ago
microsoft / Lightweight-Low-Resource-NMT
View on GitHub
Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…
☆18Oct 9, 2025Updated 4 months ago
bojone / analytical-classification
View on GitHub
逻辑回归和单层softmax的解析解
☆12Jul 29, 2021Updated 4 years ago
microsoft / 2023iotlevelup
View on GitHub
☆15Feb 21, 2023Updated 3 years ago
microsoft / mysqltoolsservice
View on GitHub
MySQL Tools Service that provides MySQL Server data management capabilities.
☆22Jun 11, 2024Updated last year
facebookresearch / denoised_mdp
View on GitHub
Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"
☆137Aug 15, 2023Updated 2 years ago
WuTheFWasThat / PyChurch
View on GitHub
A probabilistic programming language, based on Church
☆17Oct 11, 2017Updated 8 years ago
testzer0 / ZS-Summ-GPT3
View on GitHub
Zero-Shot Summarization with GPT-3
☆17Sep 11, 2023Updated 2 years ago
microsoft / dstoolkit-azoda
View on GitHub
Azure Object Detection Accelerator. A repo for quickly and easily setting up a sample object detection project with training, labelling, …
☆20May 23, 2023Updated 2 years ago
Stanford-ILIAD / Conventions-ModularPolicy
View on GitHub
PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021
☆15Mar 9, 2021Updated 4 years ago
mansicer / Q-Adapter
View on GitHub
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
automl / jahs_bench_201
View on GitHub
The first collection of surrogate benchmarks for Joint Architecture and Hyperparameter Search.
☆15Mar 22, 2023Updated 2 years ago
typoverflow / flow-rl
View on GitHub
Flow RL is a high-performance RL library with flow and diffusion models.
☆28Updated this week
facebookresearch / how-to-autorl
View on GitHub
Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…
☆86Nov 27, 2023Updated 2 years ago
nikhilbarhate99 / min-decision-transformer
View on GitHub
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…
☆288Jun 10, 2022Updated 3 years ago
microsoft / azure-databricks-sql-workshop-ja
View on GitHub
A repository for managing workshop contents for learning Microsoft Azure's data analytics platform with a focus on Databricks SQL and Syn…
☆21Jul 4, 2023Updated 2 years ago
sebbyjp / robo_transformers
View on GitHub
☆18Feb 7, 2026Updated last month
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
MaxSobolMark / OOO
View on GitHub
Official repo for Offline RL for Online RL
☆19Oct 14, 2023Updated 2 years ago
donglixp / ICL_PaperList
View on GitHub
Paper List for In-context Learning 🌷
☆20Jan 3, 2023Updated 3 years ago
clvrai / create
View on GitHub
CREATE Environment for long-horizon physics-puzzle tasks with diverse tools
☆18Nov 22, 2022Updated 3 years ago
mazpie / genrl
View on GitHub
[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…
☆86Apr 4, 2025Updated 11 months ago
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated 11 months ago
hammer-wang / Awesome-Transformers-for-Sequential-Decision-Making
View on GitHub
Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.
☆49Dec 21, 2022Updated 3 years ago
automl / arlbench
View on GitHub
HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient
☆27Jan 14, 2026Updated last month
junsu-kim97 / PIG
View on GitHub
PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).
☆20Mar 4, 2023Updated 3 years ago