基于强化学习(RL)的冰壶游戏实例; 梯度下降的Sarsa(lambda) + 非均匀径向基特征表示
☆21Jul 5, 2020Updated 5 years ago
Alternatives and similar repositories for Reinforcement_Learning_Curling
Users that are interested in Reinforcement_Learning_Curling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Almost curling game☆12Mar 16, 2019Updated 7 years ago
- Battleship environment for reinforcement learning tasks☆14Apr 29, 2023Updated 2 years ago
- CarND Capstone☆10Apr 2, 2018Updated 8 years ago
- 比较好的网络框架,有时间可以看看 kcp☆10Sep 26, 2021Updated 4 years ago
- Reinforcement learning demo with slides in David Silver RL lectures☆13Jan 7, 2018Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆13Jul 23, 2023Updated 2 years ago
- Predictive Triggering Framework for Distributed Control of Resource Constrained Multi-agent Systems☆13May 19, 2019Updated 6 years ago
- This repository contains control algorithms for unmanned surface vehicles☆15Jun 9, 2021Updated 4 years ago
- ☆16Jul 29, 2025Updated 8 months ago
- 中国科学院大学人工智能学院模式识别(刘成林,向世明,张煦尧老师)☆39Jan 9, 2021Updated 5 years ago
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆25Jun 17, 2025Updated 9 months ago
- SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability☆17May 8, 2025Updated 11 months ago
- [JSAC 2019] Energy-Efficient Distributed Mobile Crowd Sensing: A Deep Learning Approach☆15May 16, 2022Updated 3 years ago
- Simple top-down low-performance Open AI gym wrapper of a ship simulator with training scripts for Rllib and stable-baselines☆20Dec 8, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Efficient Global Optimization☆10Feb 26, 2016Updated 10 years ago
- A keras based implementation of FuSENet as in paper "Fused Squeeze-and-Excitation Network for Spectral-Spatial Hyperspectral Image Classi…☆12Dec 12, 2020Updated 5 years ago
- Inverse Kinematics for MANO hands☆19Feb 23, 2022Updated 4 years ago
- Deep Collaborative Attention Network for Hyperspectral Image Classification by Combining 2-D CNN and 3-D CNN, JSTARS, 2020☆10Aug 31, 2020Updated 5 years ago
- AlphaGo inspired TSP Heuristic Solver