luckeciano / transformers-metarl
Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022
☆60Updated last year
Alternatives and similar repositories for transformers-metarl:
Users that are interested in transformers-metarl are comparing it to the libraries listed below
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆51Updated last year
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- Model-Based Offline Reinforcement Learning☆50Updated 4 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆101Updated 2 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆135Updated 10 months ago
- Official code repository for Prompt-DT.☆107Updated 2 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆39Updated 4 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆103Updated 2 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆45Updated 3 years ago
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆56Updated 2 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆92Updated last week
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Updated last year
- Synthetic Experience Replay☆89Updated 10 months ago
- ☆108Updated last year
- ☆55Updated 2 years ago
- ☆53Updated last year
- CORRO code☆35Updated 2 years ago
- ☆48Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆75Updated last year
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆75Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆166Updated 2 years ago
- Official implementation of Neural Episodic Control with State Abstraction☆13Updated last year
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆123Updated 7 months ago
- Code for MOPO: Model-based Offline Policy Optimization☆177Updated 2 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆110Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year