tencent-ailab / hokoff
☆20Updated 4 months ago
Related projects: ⓘ
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆47Updated last year
- Mini HoK: a novel MARL benchmark based on the popular mobile game, Honor of Kings, to address limitations in existing environments such a…☆29Updated 3 weeks ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆30Updated 5 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆24Updated 3 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆54Updated 8 months ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆43Updated 6 months ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆42Updated last year
- Challenging Memory-based Deep Reinforcement Learning Agents☆76Updated 3 months ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆48Updated 3 years ago
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆85Updated last year
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆48Updated last year
- A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.☆55Updated 2 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆21Updated last year
- This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".☆46Updated last year
- ☆103Updated last year
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆17Updated 2 years ago
- Synthetic Experience Replay☆62Updated 3 months ago
- ☆24Updated last week
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆77Updated last year
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆51Updated last month
- ☆48Updated 6 months ago
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆28Updated 11 months ago
- ☆32Updated 5 months ago
- ☆13Updated 3 weeks ago
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆92Updated 8 months ago
- Unofficial code for online decision transformer☆37Updated last year
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆49Updated 4 months ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆32Updated 5 months ago
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆47Updated last month
- Transformer-based World Models☆66Updated last year