hammer-wang / Awesome-Transformers-for-Sequential-Decision-MakingLinks
Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.
☆49Updated 3 years ago
Alternatives and similar repositories for Awesome-Transformers-for-Sequential-Decision-Making
Users that are interested in Awesome-Transformers-for-Sequential-Decision-Making are comparing it to the libraries listed below
Sorting:
- Code for FOCAL Paper Published at ICLR 2021☆55Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆70Updated 3 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆33Updated last year
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆79Updated last year
- Implementation of Multi-Game Decision Transformers in PyTorch☆49Updated 2 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆166Updated 2 years ago
- ☆31Updated 3 years ago
- Official code repository for Prompt-DT.☆120Updated 3 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆153Updated 2 years ago
- ☆134Updated last year
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆79Updated 3 years ago
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆19Updated 3 weeks ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Updated 4 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆94Updated 2 years ago
- ☆91Updated 3 years ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated last year
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 3 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆46Updated 3 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Updated 2 years ago
- A list of Offline to Online RL papers (continually updated)☆62Updated last month
- An unofficial implementation for online decision transformer☆41Updated 3 years ago
- RLA is a tool for managing your RL experiments automatically☆31Updated last year
- Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…☆47Updated 3 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆61Updated last year
- ☆18Updated 3 years ago
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆34Updated last year
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 3 years ago
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆29Updated last year