tencent-ailab / hokoff
☆23Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for hokoff
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆49Updated last year
- Mini HoK: a novel MARL benchmark based on the popular mobile game, Honor of Kings, to address limitations in existing environments such a…☆32Updated 2 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆30Updated this week
- Implementation of Multi-Game Decision Transformers in PyTorch☆43Updated last year
- ☆53Updated last week
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆27Updated 5 months ago
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆71Updated 2 months ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆54Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆71Updated 2 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆54Updated 10 months ago
- ☆28Updated last year
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆29Updated last year
- ☆106Updated last year
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆38Updated 3 weeks ago
- Collection of RL Environments built using Madrona☆25Updated last year
- Synthetic Experience Replay☆74Updated 5 months ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- ☆25Updated 7 months ago
- Official code for the paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"☆13Updated this week
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆31Updated 7 months ago
- ☆55Updated last month
- rl-papers☆44Updated last year
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆90Updated last year
- Challenging Memory-based Deep Reinforcement Learning Agents☆86Updated 3 weeks ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆73Updated 11 months ago
- ☆77Updated last year
- Official implementation of "Learning from Visual Observation via Offline Pretrained State-to-Go Transformer"☆19Updated last year
- Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL☆33Updated last month
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆48Updated last year
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆129Updated 10 months ago