利用强化学习的Q价值迭代,Q学习以及SARSA方法解决小车爬山以及倒立摆的控制问题
☆14Jul 25, 2019Updated 6 years ago
Alternatives and similar repositories for ReinforcementLearning
Users that are interested in ReinforcementLearning are comparing it to the libraries listed below
Sorting:
- POC CVE-2019-0708 with python script!☆14Jun 24, 2019Updated 6 years ago
- 强化学习大作业1 倒立摆☆20Dec 8, 2022Updated 3 years ago
- CVE-2019-0708 Exploit Tool☆18Jul 18, 2019Updated 6 years ago
- ☆11Jan 5, 2018Updated 8 years ago
- ☆13Jun 3, 2022Updated 3 years ago
- H_inf tracking control for linear discrete-time systems using ADP☆12Jun 6, 2020Updated 5 years ago
- Master's Thesis Project: Design, Development, Modelling and Simulating of a Y6 Multi-Rotor UAV, Imlementing Control Schemes such as Propo…☆12Mar 23, 2020Updated 5 years ago
- EVQUARIUM is an evaluation tool that quantifies the accessibility of EV charging station locations using queueing and graph theory. Given…☆22Mar 1, 2026Updated 2 weeks ago
- 数据科学与人工智能中文讲义☆14Updated this week
- Official implementation of "Real-time Control of Electric Autonomous Mobility-on-Demand Systems via Graph Reinforcement Learning"☆10Jun 22, 2024Updated last year
- Implemention of lanenet model for real time lane detection using deep neural network model https://maybeshewill-cv.github.io/lanenet-lane…☆15Feb 28, 2019Updated 7 years ago
- ☆12Aug 20, 2021Updated 4 years ago
- ☆11Feb 16, 2025Updated last year
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- MATLAB simulation and final report/presentation for M.S. thesis. "Adaptive Dynamic Programming for Human Postural Balance Control"☆18May 24, 2018Updated 7 years ago
- Official Repo of "CIBench: Evaluation of LLMs as Code Interpreter "☆14Jul 19, 2024Updated last year
- Probabilistic plane extraction☆23May 16, 2019Updated 6 years ago
- A python API for plane detection in point clouds☆12Apr 22, 2021Updated 4 years ago
- Point cloud capture of Gocator3100 device☆13Dec 20, 2016Updated 9 years ago
- model free adaptive iterative learning control☆14Nov 17, 2021Updated 4 years ago
- 基于redis和mysql的数据持久化方案☆16Dec 10, 2013Updated 12 years ago
- A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval☆43Apr 13, 2022Updated 3 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Nov 3, 2021Updated 4 years ago
- Automatically generates Hydra jobset reports using `nix-review-tools`, updated hourly.☆16Updated this week
- zabbix alert script; 支持简单的压缩合并☆10Apr 18, 2019Updated 6 years ago
- This is a MATLAB-based reinforcement learning framework that includes the Proximal Policy Optimization (PPO) algorithm and its multi-agen…☆33Jan 14, 2026Updated 2 months ago
- MetaPlanner is an open source automated treatment planning method that performs meta-optimization of treatment planning hyperparameters. …☆13Nov 7, 2023Updated 2 years ago
- ☆35Sep 5, 2020Updated 5 years ago
- p2p git repo primitive☆14Jun 10, 2018Updated 7 years ago
- Data-driven attitude control design for multirotor UAVs☆20May 1, 2017Updated 8 years ago
- ☆14Aug 25, 2022Updated 3 years ago
- Companion source code for the 2nd edition of Make Your Own Programming Language☆11Nov 1, 2018Updated 7 years ago
- A repository hosting some of my own vulnerability reports and proof-of-concepts.☆15Aug 8, 2019Updated 6 years ago
- facebook/immutable-js wrapper providing static functions to work with functional programming☆12Aug 6, 2014Updated 11 years ago
- 简单的电力系统计算工具包, 用于求解普通潮流, 交直流潮流, OPF, 故障, 暂态稳定.☆30May 12, 2021Updated 4 years ago
- (Pattern Recognition 2025) Towards Trustworthy Dataset Distillation☆14Dec 8, 2024Updated last year
- quadruped simulation using unitree a1 in pybullet, controller code from stanford pupper☆14May 19, 2021Updated 4 years ago
- Thesis: Application of Reinforcement Learning for the Control of Nonlinear Dynamical Systems☆18Apr 16, 2020Updated 5 years ago
- ☆14Apr 25, 2025Updated 10 months ago