hammer-wang / Awesome-Transformers-for-Sequential-Decision-MakingLinks
Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.
☆47Updated 2 years ago
Alternatives and similar repositories for Awesome-Transformers-for-Sequential-Decision-Making
Users that are interested in Awesome-Transformers-for-Sequential-Decision-Making are comparing it to the libraries listed below
Sorting:
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 3 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆47Updated 2 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆32Updated 9 months ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Updated 3 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆25Updated 2 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆37Updated 7 months ago
- A list of papers regarding generalization in (deep) reinforcement learning☆152Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆71Updated last year
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- ☆31Updated 2 years ago
- An unofficial implementation for online decision transformer☆40Updated 2 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- Implementation of the Off Belief Learning algorithm.☆49Updated 3 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆65Updated last year
- ☆133Updated last year
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated last year
- RLA is a tool for managing your RL experiments automatically☆29Updated 8 months ago
- [AAAI 2023 Oral] Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition☆36Updated last year
- Official code repository for Prompt-DT.☆115Updated 3 years ago
- Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022☆63Updated 2 years ago
- ☆113Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆88Updated last year
- ☆89Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆76Updated 3 years ago