luckeciano / transformers-metarlLinks

Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022

☆63

Alternatives and similar repositories for transformers-metarl

Users that are interested in transformers-metarl are comparing it to the libraries listed below

Sorting:

Dragon-Zhuang / BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆87Updated last year
LanqingLi1993 / FOCAL-ICLR
Code for FOCAL Paper Published at ICLR 2021
☆51Updated last year
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago
jesbu1 / hidio
Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options
☆46Updated 3 years ago
rraileanu / idaac
☆54Updated last year
thu-rllab / CFCQL
Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.
☆37Updated 5 months ago
eric-mitchell / macaw
Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]
☆47Updated 2 years ago
twni2016 / Memory-RL
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆63Updated last year
SwapnilPande / MOReL
Model-Based Offline Reinforcement Learning
☆51Updated 4 years ago
ryanxhr / POR
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
☆57Updated 2 years ago
ReinholdM / Offline-Pre-trained-Multi-Agent-Decision-Transformer
☆112Updated 2 years ago
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆175Updated 3 years ago
frt03 / generalized_dt
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
☆67Updated 2 years ago
amazon-science / meta-q-learning
Code for the paper "Meta-Q-Learning"( ICLR 2020)
☆103Updated 3 years ago
martius-lab / cid-in-rl
Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…
☆43Updated 3 years ago
BY571 / CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…
☆136Updated last year
daniellawson9999 / online-decision-transformer
An unofficial implementation for online decision transformer
☆40Updated 2 years ago
TonghanWang / NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
☆81Updated 2 years ago
lmzintgraf / varibad
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)
☆192Updated 2 years ago
lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆130Updated last year
Howuhh / prioritized_experience_replay
Prioritized Experience Replay implementation with proportional prioritization
☆81Updated 2 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆51Updated 2 months ago
TonghanWang / DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆51Updated 2 years ago
lucaslingle / pytorch_rl2
Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'
☆65Updated 3 years ago
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆182Updated 3 years ago
ruizhaogit / maximum_entropy_population_based_training
Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination
☆28Updated 2 years ago
Haichao-Zhang / PEX
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)
☆59Updated 2 years ago
shlee94 / Off2OnRL
☆56Updated 2 years ago
DesikRengarajan / LOGO
[ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration
☆28Updated 3 years ago
sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
☆146Updated last year