The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.
☆62Jul 22, 2025Updated 8 months ago
Alternatives and similar repositories for autorl-research
Users that are interested in autorl-research are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- MacTok is a research prototype for a one-time anonymous token scheme based on algebraic MACs.☆23Jan 20, 2023Updated 3 years ago
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- A self-supervised learning approach based on extremely large masking☆31Dec 19, 2022Updated 3 years ago
- Fault-aware neural code rankers☆32Dec 9, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆19Jun 25, 2023Updated 2 years ago
- ☆21Mar 19, 2024Updated 2 years ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆288Jun 10, 2022Updated 3 years ago
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆137Aug 15, 2023Updated 2 years ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆53Jun 12, 2023Updated 2 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Jul 19, 2023Updated 2 years ago
- Data-driven offline simulation for online reinforcement learning: benchmark and baselines☆31Jul 25, 2024Updated last year
- The first collection of surrogate benchmarks for Joint Architecture and Hyperparameter Search.☆15Mar 22, 2023Updated 3 years ago
- A lightweight reimplementation of Adversarially Trained Actor Critic☆19Mar 19, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- MySQL Tools Service that provides MySQL Server data management capabilities.☆22Jun 11, 2024Updated last year
- ☆16Jun 12, 2023Updated 2 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆87Nov 27, 2023Updated 2 years ago
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 5 months ago
- Boosting Natural Language Generation from Instructions with Meta-Learning☆11Dec 20, 2022Updated 3 years ago
- Official repo for Offline RL for Online RL☆19Oct 14, 2023Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆72Jan 18, 2024Updated 2 years ago
- Minimal example to apply Decision Transformer in Atari Pong☆15Feb 1, 2025Updated last year
- [COG24] - Official repository of "OfflineMania: A Benchmark Environment for Offline Reinforcement Learning in Racing Games"☆12Jul 15, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆43May 25, 2023Updated 2 years ago
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆48Oct 21, 2022Updated 3 years ago
- [AAAI'2024] Critic-Guided Decision Transformer for Offline Reinforcement Learning☆18May 21, 2025Updated 10 months ago
- Synthetic Experience Replay☆110May 27, 2024Updated last year
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆96May 21, 2023Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆29Sep 16, 2022Updated 3 years ago
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆86Apr 4, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆50Dec 21, 2022Updated 3 years ago
- UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning☆61Jun 12, 2023Updated 2 years ago
- Azure Object Detection Accelerator. A repo for quickly and easily setting up a sample object detection project with training, labelling, …☆20May 23, 2023Updated 2 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Mar 6, 2023Updated 3 years ago
- Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning☆28Sep 13, 2023Updated 2 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Nov 22, 2022Updated 3 years ago