eric-mitchell/macaw-min

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eric-mitchell/macaw-min)

eric-mitchell / macaw-min

Clean, extensible implementation of MACAW [ICML 2021]

☆12

Alternatives and similar repositories for macaw-min

Users that are interested in macaw-min are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eric-mitchell / macaw
View on GitHub
Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]
☆45Nov 30, 2022Updated 3 years ago
Rondorf / BOReL
View on GitHub
Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 20…
☆31Nov 23, 2021Updated 4 years ago
LanqingLi1993 / FOCAL-ICLR
View on GitHub
Code for FOCAL Paper Published at ICLR 2021
☆55Dec 4, 2023Updated 2 years ago
raymondchua / simple_successor_features
View on GitHub
Simple Successor Features
☆20Jul 15, 2025Updated last year
lmzintgraf / hyperx
View on GitHub
☆16Aug 2, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
rll-research / finetune-vs-metarl
View on GitHub
☆14May 31, 2022Updated 4 years ago
hmishfaq / LSAC
View on GitHub
The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025
☆22May 28, 2025Updated last year
boschresearch / ube-mbrl
View on GitHub
Model-Based Uncertainty in Value Functions (AISTATS2023)
☆16Feb 28, 2023Updated 3 years ago
ethanluoyc / optimal_transport_reward
View on GitHub
☆18Apr 11, 2024Updated 2 years ago
philipjball / ReadyPolicyOne
View on GitHub
🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)
☆18Jul 6, 2023Updated 3 years ago
MouseHu / GEM
View on GitHub
☆16Jul 1, 2021Updated 5 years ago
tinkoff-ai / lb-sac
View on GitHub
Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…
☆21Feb 27, 2023Updated 3 years ago
ahjwang / messenger-emma
View on GitHub
Implements the Messenger environment and EMMA model.
☆25Jun 14, 2023Updated 3 years ago
geyang / e-maml
View on GitHub
E-MAML, and RL-MAML baseline implemented in Tensorflow v1
☆17Dec 7, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
metekemertas / RobustBisimulation
View on GitHub
Learning bisimulation metrics for control, particularly suited to sparse reward settings
☆11Feb 28, 2023Updated 3 years ago
si0wang / COPlanner
View on GitHub
☆23Apr 2, 2024Updated 2 years ago
tanchongmin / ARC-Challenge
View on GitHub
☆30Sep 5, 2024Updated last year
Koziev / StressModel
View on GitHub
Neural model for prediction of stress position in Russian words
☆13Jun 22, 2025Updated last year
ben-eysenbach / mnm
View on GitHub
Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"
☆21Oct 6, 2021Updated 4 years ago
facebookresearch / bipedal-skills
View on GitHub
Bipedal Skills Benchmark for Reinforcement Learning
☆26Oct 27, 2022Updated 3 years ago
arashsm79 / eddiebot-ros
View on GitHub
ROS2 packages for Parallax Eddie robot along with simulations using Gazebo (formerly ignition gazebo)
☆16Feb 28, 2024Updated 2 years ago
t6-thu / H2Oplus
View on GitHub
[ICRA'25] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
☆13Apr 10, 2025Updated last year
sail-sg / ContinualBench
View on GitHub
☆25May 20, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
suyoung-lee / SDVT
View on GitHub
solving ml10
☆26Nov 10, 2023Updated 2 years ago
haoliuhl / taming-maml
View on GitHub
Taming MAML: efficient unbiased meta-reinforcement learning
☆30Sep 30, 2022Updated 3 years ago
keynans / HypeRL
View on GitHub
Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)
☆26Jun 9, 2021Updated 5 years ago
ldcq / ldcq
View on GitHub
☆35May 24, 2023Updated 3 years ago
tdmpc2 / tdmpc2-eval
View on GitHub
Evaluation of TD-MPC2.
☆21Jan 21, 2024Updated 2 years ago
ConfeitoHS / arcle
View on GitHub
A Gymnasium-based Environment of the Abstraction and Reasoning Corpus (ARC)
☆73Aug 30, 2024Updated last year
PKU-RL / CORRO
View on GitHub
[ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
☆40Aug 17, 2022Updated 3 years ago
end3r / Gamepad-API-Content-Kit
View on GitHub
Gamepad API Content Kit
☆14Jun 1, 2016Updated 10 years ago
RoozbehRazavi / BIMRL
View on GitHub
Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)
☆10Dec 1, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zhihanyang2022 / alpha-zero
View on GitHub
Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.
☆21Aug 12, 2022Updated 3 years ago
polixir / d3pe
View on GitHub
D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.
☆10Jun 2, 2022Updated 4 years ago
bit1029public / HRSSM
View on GitHub
Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models
☆25May 11, 2024Updated 2 years ago
pgermain / PAC-Bayesian-Theory-Meets-Bayesian-Inference
View on GitHub
Code to related to my NIPS 2016 paper
☆10Dec 4, 2016Updated 9 years ago
woonsangcho / contrast_qgen
View on GitHub
Code for 'Contrastive Multi-Document Question Generation'
☆11Oct 16, 2022Updated 3 years ago
Open-X-Humanoid / Robo-ValueRL
View on GitHub
☆17Jul 13, 2026Updated last week
facebookresearch / reward-estimator-iclr
View on GitHub
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆11May 8, 2018Updated 8 years ago