hammer-wang / Awesome-Transformers-for-Sequential-Decision-MakingLinks
Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.
☆47Updated 2 years ago
Alternatives and similar repositories for Awesome-Transformers-for-Sequential-Decision-Making
Users that are interested in Awesome-Transformers-for-Sequential-Decision-Making are comparing it to the libraries listed below
Sorting:
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆68Updated 3 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- Code for FOCAL Paper Published at ICLR 2021☆52Updated last year
- Implementation of Multi-Game Decision Transformers in PyTorch☆47Updated 2 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Official code repository for Prompt-DT.☆117Updated 3 years ago
- ☆18Updated last year
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆77Updated last year
- An unofficial implementation for online decision transformer☆40Updated 3 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆25Updated 2 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆15Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆164Updated 2 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆153Updated 2 years ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated last year
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆49Updated 3 years ago
- Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy☆20Updated 3 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆33Updated 11 months ago
- ☆31Updated 2 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆46Updated 2 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆60Updated last year
- Mirror Descent Policy Optimization☆41Updated 5 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆68Updated last year
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Updated 4 years ago
- Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…☆47Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆92Updated last year
- ☆131Updated last year
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆40Updated 9 months ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Updated 3 years ago