luckeciano / transformers-metarlLinks
Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022
☆63Updated 2 years ago
Alternatives and similar repositories for transformers-metarl
Users that are interested in transformers-metarl are comparing it to the libraries listed below
Sorting:
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆46Updated 3 years ago
- ☆54Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆37Updated 5 months ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆63Updated last year
- Model-Based Offline Reinforcement Learning☆51Updated 4 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- ☆112Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆175Updated 3 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆103Updated 3 years ago
- Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…☆43Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆136Updated last year
- An unofficial implementation for online decision transformer☆40Updated 2 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆192Updated 2 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆130Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆81Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆51Updated 2 months ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 2 years ago
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆65Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆182Updated 3 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆28Updated 2 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆59Updated 2 years ago
- ☆56Updated 2 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆146Updated last year