leofansq / Reinforcement_Learning_CurlingLinks
基于强化学习(RL)的冰壶游戏实例; 梯度下降的Sarsa(lambda) + 非均匀径向基特征表示
☆21Updated 5 years ago
Alternatives and similar repositories for Reinforcement_Learning_Curling
Users that are interested in Reinforcement_Learning_Curling are comparing it to the libraries listed below
Sorting:
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆143Updated 7 months ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆39Updated 4 months ago
- Play atari Tennis game by dqn☆78Updated 3 years ago
- Official implementation of the NeurIPS 2024 paper CORY☆24Updated 9 months ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆50Updated 8 months ago
- TextStarCraft2,a pure language env which support llms play starcraft2☆293Updated 7 months ago
- A curated list of visual reinforcement learning resources☆447Updated 3 weeks ago
- Multiagent Reinforcement Learning Research Project☆220Updated 5 months ago
- A large-scale multi-modal pre-trained model☆132Updated 2 years ago
- Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]☆16Updated 5 years ago
- ☆49Updated 7 months ago
- ☆68Updated last year
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆113Updated last year
- ☆172Updated 2 years ago
- Unified Reinforcement Learning Framework☆797Updated last year
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆194Updated last year
- RL algorithms☆141Updated 4 years ago
- Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)☆34Updated last year
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆38Updated 2 years ago
- ☆55Updated 10 months ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆156Updated last year
- [AAAI 2025 Oral] Official code for "RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors"☆32Updated 10 months ago
- 历年ICML论文和开源项目合集,包含ICML2021、ICML2022、ICML2023、ICML2024、ICML2025.☆37Updated 9 months ago
- 天授中文文档☆61Updated last year
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆13Updated last year
- ☆33Updated 2 years ago
- Implementation of TWOSOME☆82Updated 11 months ago
- basic algorithms of reinforcement learning☆215Updated 2 years ago
- ☆88Updated 2 years ago
- rl-papers☆48Updated 2 years ago