Clean, extensible implementation of MACAW [ICML 2021]
☆12Dec 7, 2021Updated 4 years ago
Alternatives and similar repositories for macaw-min
Users that are interested in macaw-min are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆45Nov 30, 2022Updated 3 years ago
- Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 20…☆31Nov 23, 2021Updated 4 years ago
- Code for FOCAL Paper Published at ICLR 2021☆55Dec 4, 2023Updated 2 years ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Oct 9, 2023Updated 2 years ago
- ☆16Aug 2, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21May 20, 2025Updated 11 months ago
- ☆15Apr 5, 2023Updated 3 years ago
- ☆14May 31, 2022Updated 3 years ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆16Feb 28, 2023Updated 3 years ago
- ☆18Apr 11, 2024Updated 2 years ago
- ☆16Jul 1, 2021Updated 4 years ago
- ☆35Mar 30, 2026Updated last month
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Feb 27, 2023Updated 3 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- Implements the Messenger environment and EMMA model.☆25Jun 14, 2023Updated 2 years ago
- ☆23Apr 2, 2024Updated 2 years ago
- ☆30Sep 5, 2024Updated last year
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Oct 6, 2021Updated 4 years ago
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 10 months ago
- Official Code for the L4DC 2023 conference paper and ICLR 2023 NeSy-GeMs workshop paper.☆15Oct 17, 2023Updated 2 years ago
- Bipedal Skills Benchmark for Reinforcement Learning☆25Oct 27, 2022Updated 3 years ago
- [ICRA'25] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps☆12Apr 10, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- solving ml10☆26Nov 10, 2023Updated 2 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Sep 30, 2022Updated 3 years ago
- ☆36May 24, 2023Updated 2 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆26Jun 9, 2021Updated 4 years ago
- A Gymnasium-based Environment of the Abstraction and Reasoning Corpus (ARC)☆71Aug 30, 2024Updated last year
- [ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning☆40Aug 17, 2022Updated 3 years ago
- Command-Line Game of Hex in C++☆10Jan 10, 2026Updated 3 months ago
- Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …☆17Jul 31, 2024Updated last year
- Gamepad API Content Kit☆14Jun 1, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆11Jun 2, 2022Updated 3 years ago
- Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models☆25May 11, 2024Updated last year
- The Codebase of <Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning> In NeurIPS 2024☆25Feb 20, 2025Updated last year
- Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.☆21Aug 12, 2022Updated 3 years ago
- Code to related to my NIPS 2016 paper☆10Dec 4, 2016Updated 9 years ago
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- [MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003☆11Oct 6, 2022Updated 3 years ago