leofansq / Reinforcement_Learning_Curling
基于强化学习(RL)的冰壶游戏实例; 梯度下降的Sarsa(lambda) + 非均匀径向基特征表示
☆20Updated 4 years ago
Alternatives and similar repositories for Reinforcement_Learning_Curling:
Users that are interested in Reinforcement_Learning_Curling are comparing it to the libraries listed below
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆99Updated last year
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- Adversarial Reinforcement Learning papers (single-agent setting and multi-agent setting)☆70Updated 2 years ago
- The implementation of AAAI'22 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".☆52Updated last year
- ☆71Updated last year
- ICLR'2024: Learning Multi-Agent Communication from Graph Modeling Perspective☆28Updated last year
- rl-papers☆47Updated 2 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆55Updated 3 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆73Updated 2 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆87Updated last year
- ☆38Updated this week
- A large-scale multi-modal pre-trained model☆131Updated 2 years ago
- Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]☆16Updated 4 years ago
- ppo+action mask for atari tennis agent☆11Updated 2 years ago
- 多智能体强化学习VDN、QMIX、QTRAN、QPLEX复现☆32Updated last year
- ☆62Updated last year
- ☆30Updated 2 years ago
- ☆43Updated 2 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Updated 2 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆60Updated last year
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆52Updated 5 years ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆25Updated 2 months ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆166Updated last year
- pytorch实现的一些MARL算法☆66Updated 3 years ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆115Updated 2 months ago
- Multiagent Reinforcement Learning Research Project☆193Updated 5 months ago
- Half Field Offense in Robocup 2D Soccer with reinforcement learning☆34Updated 3 years ago
- ☆29Updated last year
- ☆163Updated last year
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆153Updated last year