The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.
☆62Jul 22, 2025Updated 8 months ago
Alternatives and similar repositories for autorl-research
Users that are interested in autorl-research are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated last year
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- MacTok is a research prototype for a one-time anonymous token scheme based on algebraic MACs.☆23Jan 20, 2023Updated 3 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Dec 30, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- Sandbox environment for generalizable agent research☆27Aug 19, 2022Updated 3 years ago
- A self-supervised learning approach based on extremely large masking☆31Dec 19, 2022Updated 3 years ago
- Fault-aware neural code rankers☆32Dec 9, 2022Updated 3 years ago
- ☆19Jun 25, 2023Updated 2 years ago
- ☆21Mar 19, 2024Updated 2 years ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆289Jun 10, 2022Updated 3 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Jul 19, 2023Updated 2 years ago
- The first collection of surrogate benchmarks for Joint Architecture and Hyperparameter Search.☆15Mar 22, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A fork of adoptium/aqa-tests with Msft specific changes☆12Updated this week
- A lightweight reimplementation of Adversarially Trained Actor Critic☆20Mar 19, 2026Updated 3 weeks ago
- Gallery for Industry AI demos☆18May 1, 2023Updated 2 years ago
- MySQL Tools Service that provides MySQL Server data management capabilities.☆22Jun 11, 2024Updated last year
- ☆16Jun 12, 2023Updated 2 years ago
- A Decision Transformer for solving optimal EV charging problems using offline data.☆18Jan 19, 2026Updated 2 months ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆87Nov 27, 2023Updated 2 years ago
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 6 months ago
- Boosting Natural Language Generation from Instructions with Meta-Learning☆11Dec 20, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient☆30Mar 16, 2026Updated last month
- Official repo for Offline RL for Online RL☆19Oct 14, 2023Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆72Jan 18, 2024Updated 2 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆28Jun 3, 2023Updated 2 years ago
- Minimal example to apply Decision Transformer in Atari Pong☆15Feb 1, 2025Updated last year
- [COG24] - Official repository of "OfflineMania: A Benchmark Environment for Offline Reinforcement Learning in Racing Games"☆12Jul 15, 2024Updated last year
- ☆43May 25, 2023Updated 2 years ago
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆48Oct 21, 2022Updated 3 years ago
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆96May 21, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Synthetic Experience Replay☆111May 27, 2024Updated last year
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆29Sep 16, 2022Updated 3 years ago
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆86Apr 4, 2025Updated last year
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆50Dec 21, 2022Updated 3 years ago
- UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning☆61Jun 12, 2023Updated 2 years ago
- Azure Object Detection Accelerator. A repo for quickly and easily setting up a sample object detection project with training, labelling, …☆20May 23, 2023Updated 2 years ago