eric-mitchell/macaw

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eric-mitchell/macaw)

eric-mitchell / macaw

Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

☆45

Alternatives and similar repositories for macaw

Users that are interested in macaw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eric-mitchell / macaw-min
View on GitHub
Clean, extensible implementation of MACAW [ICML 2021]
☆12Dec 7, 2021Updated 4 years ago
ethanluoyc / optimal_transport_reward
View on GitHub
☆18Apr 11, 2024Updated 2 years ago
LanqingLi1993 / FOCAL-ICLR
View on GitHub
Code for FOCAL Paper Published at ICLR 2021
☆55Dec 4, 2023Updated 2 years ago
Rondorf / BOReL
View on GitHub
Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 20…
☆31Nov 23, 2021Updated 4 years ago
suyoung-lee / LDM
View on GitHub
Latent Dynamics Mixture, NeurIPS 2021
☆18Oct 25, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
russellmendonca / mier_public
View on GitHub
☆13Mar 16, 2023Updated 3 years ago
mxu34 / prompt-dt
View on GitHub
Official code repository for Prompt-DT.
☆123Aug 3, 2022Updated 3 years ago
ezliu / dream
View on GitHub
Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning
☆92Feb 13, 2023Updated 3 years ago
Mehooz / BIRD_code
View on GitHub
Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".
☆14May 23, 2021Updated 5 years ago
yiqiwang8177 / Official-codebase-for-Decision-Transducer
View on GitHub
This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…
☆11Oct 9, 2023Updated 2 years ago
PKU-RL / CORRO
View on GitHub
[ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
☆40Aug 17, 2022Updated 3 years ago
tianjunz / MADE
View on GitHub
☆19Jul 18, 2021Updated 5 years ago
haoliuhl / taming-maml
View on GitHub
Taming MAML: efficient unbiased meta-reinforcement learning
☆30Sep 30, 2022Updated 3 years ago
shlee94 / Off2OnRL
View on GitHub
☆61Feb 3, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zzxslp / XL-VLN
View on GitHub
Dataset for Bilingual VLN
☆11Dec 5, 2020Updated 5 years ago
Jielin-Qiu / MMWatermark-Robustness
View on GitHub
Evaluating Durability: Benchmark Insights into Multimodal Watermarking
☆12Jun 7, 2024Updated 2 years ago
vacancy / PDSketch-Alpha-Release
View on GitHub
☆17Nov 1, 2023Updated 2 years ago
DesikRengarajan / EMRLD
View on GitHub
[NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
☆12Sep 28, 2022Updated 3 years ago
philipjball / ReadyPolicyOne
View on GitHub
🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)
☆18Jul 6, 2023Updated 3 years ago
tonyzhaozh / meld
View on GitHub
MELD: Meta-Reinforcement Learning from Images via Latent State Models https://arxiv.org/abs/2010.13957
☆67Apr 30, 2021Updated 5 years ago
keynans / HypeRL
View on GitHub
Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)
☆26Jun 9, 2021Updated 5 years ago
lmzintgraf / varibad
View on GitHub
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)
☆203Mar 15, 2023Updated 3 years ago
MLD3 / OfflineRL_ModelSelection
View on GitHub
[MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003
☆11Oct 6, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
jdchang1 / milo
View on GitHub
☆16Oct 5, 2021Updated 4 years ago
mbchang / crl
View on GitHub
Automatically Composing Representation Transformations as a Means for Generalization
☆24Jun 3, 2019Updated 7 years ago
ryanxhr / DWBC
View on GitHub
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
☆35Jan 5, 2023Updated 3 years ago
pcchenxi / LAPO-offlienRL
View on GitHub
☆16Apr 14, 2026Updated 3 months ago
namsan96 / SiMPL
View on GitHub
☆50Jul 30, 2023Updated 2 years ago
avillaflor / SPLT-transformer
View on GitHub
☆18Jul 10, 2022Updated 4 years ago
mhw32 / meta-inference-public
View on GitHub
A PyTorch implementation of "Meta-Amortized Variational Inference and Learning" (https://arxiv.org/abs/1902.01950)
☆14Mar 31, 2020Updated 6 years ago
rll-research / finetune-vs-metarl
View on GitHub
☆14May 31, 2022Updated 4 years ago
takuseno / d3rlpy-benchmarks
View on GitHub
Benchmark data for d3rlpy
☆22Nov 28, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
rr-learning / trifinger_rl_datasets
View on GitHub
A python package for loading robotics datasets which were recorded on the TriFinger platform. Also contains simulated gym environments th…
☆17Jan 17, 2024Updated 2 years ago
vuoristo / MMAML-Regression
View on GitHub
☆22Mar 7, 2021Updated 5 years ago
Cloud0723 / Offline-MLIRL
View on GitHub
☆22Dec 18, 2023Updated 2 years ago
PKU-RL / PTGM
View on GitHub
[ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning
☆30Mar 1, 2024Updated 2 years ago
facebookresearch / ExPLORe
View on GitHub
This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".
☆26Dec 5, 2023Updated 2 years ago
zhihanyang2022 / alpha-zero
View on GitHub
Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.
☆21Aug 12, 2022Updated 3 years ago
Leot6 / AMoD2
View on GitHub
A high-capacity on-demand ride-sharing simulator, with three representative vehicle dispatch algorithms implemented.
☆17Jan 14, 2022Updated 4 years ago