luckeciano / transformers-metarlLinks
Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022
☆65Updated 2 years ago
Alternatives and similar repositories for transformers-metarl
Users that are interested in transformers-metarl are comparing it to the libraries listed below
Sorting:
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆68Updated last year
- ☆54Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆46Updated 4 years ago
- Code for FOCAL Paper Published at ICLR 2021☆53Updated 2 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆106Updated 3 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆182Updated 3 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆46Updated 3 years ago
- Model-Based Offline Reinforcement Learning☆51Updated 4 years ago
- ☆115Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 7 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆144Updated last year
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆62Updated 2 years ago
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆71Updated 3 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆198Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆190Updated 3 years ago
- Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…☆47Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Updated 5 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆140Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆85Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆69Updated 3 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆155Updated 4 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆40Updated 10 months ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 3 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆157Updated 2 years ago
- Representation Learning for RL☆129Updated 2 years ago
- ☆58Updated 2 years ago
- ☆15Updated 4 years ago