kamildar / gym-match3
env for gym, match3 game
☆11Updated 5 years ago
Related projects: ⓘ
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆47Updated last year
- A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.☆55Updated 2 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- ☆35Updated 2 years ago
- original source code of the ASE 2019 paper: Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning☆25Updated 4 years ago
- ☆41Updated 3 years ago
- ☆55Updated this week
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆44Updated 5 years ago
- ☆79Updated 2 months ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆106Updated last year
- Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"☆50Updated 10 months ago
- World Models with A3C on Carracing-v0 in gym☆32Updated 4 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆51Updated 3 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆13Updated 2 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- PyTorch RL for Pommerman☆38Updated 5 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆17Updated 2 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆76Updated 5 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆14Updated 3 years ago
- ☆135Updated 3 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆36Updated 4 years ago
- ☆24Updated 2 years ago
- A PyTorch implementation of SSINet.☆16Updated 3 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆17Updated 2 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆62Updated 3 years ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆55Updated last year
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 5 years ago