基于强化学习(RL)的冰壶游戏实例; 梯度下降的Sarsa(lambda) + 非均匀径向基特征表示
☆21Jul 5, 2020Updated 5 years ago
Alternatives and similar repositories for Reinforcement_Learning_Curling
Users that are interested in Reinforcement_Learning_Curling are comparing it to the libraries listed below
Sorting:
- ☆10Mar 1, 2024Updated 2 years ago
- A teleoperation package to control UR robot with VR controller☆15Dec 10, 2024Updated last year
- 中国科学院大学人工智能学院模式识别(刘成林,向世明,张煦尧老师)☆39Jan 9, 2021Updated 5 years ago
- [TGRS 2024 ESI Highly Cited Paper (TOP 1%)] Sliding Dual-Window-Inspired Reconstruction Network for Hyperspectral Anomaly Detection☆13Feb 28, 2024Updated 2 years ago
- Repository for lecture "Data-Driven Demand Learning and Dynamic Pricing Strategies in Competitive Markets"☆12May 8, 2018Updated 7 years ago
- 整合uav模型与gazebo环境,开放测试demo,可使用键盘控制无人机进行遥控飞行与飞行状态数据检测等,是本项目的初步仿真环境成果展示。☆11Nov 22, 2020Updated 5 years ago
- Deep Collaborative Attention Network for Hyperspectral Image Classification by Combining 2-D CNN and 3-D CNN, JSTARS, 2020☆10Aug 31, 2020Updated 5 years ago
- Implement reinforcement learning algorithms to realize highway decision making of autonomous vehicles☆12Apr 27, 2023Updated 2 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- ☆16Jul 29, 2025Updated 7 months ago
- Efficient Global Optimization☆10Feb 26, 2016Updated 10 years ago
- Tabu Search heuristic for Travelling Salesperson Problems with Profits☆11Oct 16, 2018Updated 7 years ago
- 实例分割标注文件格式转换脚本工具集☆10May 5, 2023Updated 2 years ago
- simple demos including: ur5 control in pybullet, virtual camera perception from end effector☆12Apr 12, 2021Updated 4 years ago
- A Python implementation of the SARSA λ reinforcement learning algorithm☆12Mar 6, 2019Updated 7 years ago
- Deep neural network from Newton vs the Machine☆16Oct 23, 2019Updated 6 years ago
- ☆14May 4, 2024Updated last year
- 水面无人艇(USVs)的协同作战问题☆16Jul 14, 2021Updated 4 years ago
- AlphaGo inspired TSP Heuristic Solver☆15Feb 5, 2020Updated 6 years ago
- Hyperspectral Guided Image Dehazing GAN☆10Sep 4, 2020Updated 5 years ago
- ☆13Nov 29, 2024Updated last year
- LaTeX Proposal Template for the University of Chinese Academy of Sciences☆18Oct 14, 2023Updated 2 years ago
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆24Jun 17, 2025Updated 8 months ago
- 学习资料整理☆12Nov 28, 2012Updated 13 years ago
- The official code of "CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval"☆14Sep 19, 2024Updated last year
- Direct preference optimization with f-divergences.☆16Nov 3, 2024Updated last year
- dataloader for mocap dataset☆29Oct 21, 2025Updated 4 months ago
- Open source volume electron microscopy (vEM) datasets for connectomics.☆15Nov 13, 2024Updated last year
- Perception for human robot handover☆12Dec 30, 2020Updated 5 years ago
- Predictive Triggering Framework for Distributed Control of Resource Constrained Multi-agent Systems☆13May 19, 2019Updated 6 years ago
- GA for orienteering problem☆12Mar 16, 2016Updated 9 years ago
- TeraSim open-source academic version with co-simulation☆22Oct 21, 2025Updated 4 months ago
- TSPPD Test Instance Library☆15Mar 4, 2019Updated 7 years ago
- A Self Driving Remote Control Car☆17Jun 8, 2019Updated 6 years ago
- Classifying Deepfakes Using One-Class Variational Autoencoder☆12Apr 12, 2023Updated 2 years ago
- Hyperspectral Anomaly Detection☆15Apr 3, 2021Updated 4 years ago
- 6D Object Pose Estimation using RGBD Data and Fast-ICP☆18Mar 2, 2018Updated 8 years ago
- [TGRS 2024] MSNet: Self-Supervised Multiscale Network with Enhanced Separation Training for Hyperspectral Anomaly Detection.☆19Jul 26, 2024Updated last year
- Code for the paper "3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation"☆32Aug 18, 2025Updated 6 months ago