leofansq / Reinforcement_Learning_Curling
基于强化学习(RL)的冰壶游戏实例; 梯度下降的Sarsa(lambda) + 非均匀径向基特征表示
☆18Updated 4 years ago
Alternatives and similar repositories for Reinforcement_Learning_Curling:
Users that are interested in Reinforcement_Learning_Curling are comparing it to the libraries listed below
- Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization☆12Updated 6 months ago
- Adversarial Reinforcement Learning papers (single-agent setting and multi-agent setting)☆64Updated 2 years ago
- ☆35Updated last month
- ☆27Updated 9 months ago
- rl-papers☆47Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆72Updated 2 years ago
- Attempt to reproduce and improve the implementation of the paper 'Improving Multi-Target Cooperative Tracking Guidance for UAV Swarms Usi…☆32Updated 7 months ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆19Updated this week
- 基于强化学习的空战对抗☆64Updated 3 years ago
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆98Updated last year
- ☆159Updated last year
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆57Updated last year
- (ICML 2024) The official code for EvoRainbow: Combining Improvements in Evolutionary Reinforcement Learning for Policy Search☆21Updated 7 months ago
- Solve BipedalWalkerHardcore-v2 with TD3☆83Updated last year
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- ☆100Updated last month
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆149Updated 6 months ago
- ☆19Updated last year
- ☆35Updated last year
- Multiagent Reinforcement Learning Research Project☆128Updated 3 months ago
- [ICML' 24] The PyTorch implementation of our paper: "Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforc…☆15Updated 7 months ago
- ☆41Updated 3 years ago
- A large-scale multi-modal pre-trained model☆129Updated last year
- NeurIPS 2024 DACER☆62Updated 3 weeks ago
- Play atari Tennis game by dqn☆71Updated 2 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆162Updated last year
- A collection of recent MARL papers☆82Updated 2 months ago
- Use seaborn to draw RL picture☆25Updated last year
- ☆93Updated 3 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆52Updated 4 years ago