The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.
☆63Jul 22, 2025Updated 10 months ago
Alternatives and similar repositories for autorl-research
Users that are interested in autorl-research are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated 2 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 5 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Dec 30, 2022Updated 3 years ago
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Sandbox environment for generalizable agent research☆27Aug 19, 2022Updated 3 years ago
- A self-supervised learning approach based on extremely large masking☆31Dec 19, 2022Updated 3 years ago
- Fault-aware neural code rankers☆32Dec 9, 2022Updated 3 years ago
- ☆21Mar 19, 2024Updated 2 years ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆293Jun 10, 2022Updated 3 years ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆53Jun 12, 2023Updated 2 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆28Jul 19, 2023Updated 2 years ago
- The first collection of surrogate benchmarks for Joint Architecture and Hyperparameter Search.☆15Mar 22, 2023Updated 3 years ago
- A lightweight reimplementation of Adversarially Trained Actor Critic☆20Mar 19, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Gallery for Industry AI demos☆18May 1, 2023Updated 3 years ago
- MySQL Tools Service that provides MySQL Server data management capabilities.☆22Jun 11, 2024Updated last year
- Code for the paper "Continual Model-Based Reinforcement Learning with Hypernetworks"☆15Jul 28, 2021Updated 4 years ago
- ☆46May 14, 2026Updated last week
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆87Nov 27, 2023Updated 2 years ago
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 7 months ago
- Boosting Natural Language Generation from Instructions with Meta-Learning☆11Dec 20, 2022Updated 3 years ago
- HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient☆30Mar 16, 2026Updated 2 months ago
- Official repo for Offline RL for Online RL☆18Oct 14, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆73Apr 26, 2026Updated last month
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆28Jun 3, 2023Updated 2 years ago
- PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).☆21Mar 4, 2023Updated 3 years ago
- [COG24] - Official repository of "OfflineMania: A Benchmark Environment for Offline Reinforcement Learning in Racing Games"☆12Jul 15, 2024Updated last year
- ☆43May 25, 2023Updated 3 years ago
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆48Oct 21, 2022Updated 3 years ago
- [AAAI'2024] Critic-Guided Decision Transformer for Offline Reinforcement Learning☆18May 21, 2025Updated last year
- 遗传算法求解柔性车 间调度问题☆13Jun 3, 2023Updated 2 years ago
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆97May 21, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Synthetic Experience Replay☆112Apr 16, 2026Updated last month
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- A learning-based scheme to capture external force/torque caused by payload of tethered-UAV system☆20May 27, 2025Updated last year
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆30Sep 16, 2022Updated 3 years ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆50Dec 21, 2022Updated 3 years ago
- UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning☆61Jun 12, 2023Updated 2 years ago