利用强化学习的Q价值迭代,Q学习以及SARSA方法解决小车爬山以及倒立摆的控制问题
☆14Jul 25, 2019Updated 6 years ago
Alternatives and similar repositories for ReinforcementLearning
Users that are interested in ReinforcementLearning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 强化学习大作业1 倒立摆☆20Dec 8, 2022Updated 3 years ago
- ☆13Jun 3, 2022Updated 4 years ago
- H_inf tracking control for linear discrete-time systems using ADP☆12Jun 6, 2020Updated 6 years ago
- Master's Thesis Project: Design, Development, Modelling and Simulating of a Y6 Multi-Rotor UAV, Imlementing Control Schemes such as Propo…☆12Mar 23, 2020Updated 6 years ago
- 数据科学与人工智能中文讲义☆14May 13, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of "Real-time Control of Electric Autonomous Mobility-on-Demand Systems via Graph Reinforcement Learning"☆11Jun 22, 2024Updated last year
- Implemention of lanenet model for real time lane detection using deep neural network model https://maybeshewill-cv.github.io/lanenet-lane…☆15Feb 28, 2019Updated 7 years ago
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- MATLAB simulation and final report/presentation for M.S. thesis. "Adaptive Dynamic Programming for Human Postural Balance Control"☆18May 24, 2018Updated 8 years ago
- A python API for plane detection in point clouds☆12Apr 22, 2021Updated 5 years ago
- Point cloud capture of Gocator3100 device☆13Dec 20, 2016Updated 9 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Nov 3, 2021Updated 4 years ago
- This is a MATLAB-based reinforcement learning framework that includes the Proximal Policy Optimization (PPO) algorithm and its multi-agen…☆33Jan 14, 2026Updated 4 months ago
- MetaPlanner is an open source automated treatment planning method that performs meta-optimization of treatment planning hyperparameters. …☆14Nov 7, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆35Sep 5, 2020Updated 5 years ago
- Data-driven attitude control design for multirotor UAVs☆21May 1, 2017Updated 9 years ago
- Thesis: Application of Reinforcement Learning for the Control of Nonlinear Dynamical Systems☆18Apr 16, 2020Updated 6 years ago
- Latent Dynamics Mixture, NeurIPS 2021☆18Oct 25, 2022Updated 3 years ago
- MRI-based Deep Learning Segmentation and Radiomics of Sarcoma Tumors in Mice☆16Mar 24, 2023Updated 3 years ago
- This repo features a deep reinforcement learning Home Energy Management System for cost-effective heating. It optimizes energy consumptio…☆14Dec 19, 2023Updated 2 years ago
- Lorentzian Distance Learning for Hyperbolic Representations: Retrieval experiments☆13May 28, 2019Updated 7 years ago
- ☆15Oct 6, 2019Updated 6 years ago
- This branch contain the java classes for orekit-python-wrapper☆20May 13, 2026Updated 3 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆18Feb 22, 2022Updated 4 years ago
- Approximate dynamic programming for stochastic optimal control in Pytorch☆24Aug 26, 2023Updated 2 years ago
- ☆22May 20, 2021Updated 5 years ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆20May 27, 2025Updated last year
- Implementation of NeurIPS 2018 paper "Meta-Gradient Reinforcement Learning"☆21Jul 19, 2022Updated 3 years ago
- Official Implementation of DMT: Dual Mean-Teacher in PyTorch.☆10Oct 27, 2023Updated 2 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆29Nov 23, 2024Updated last year
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆26Sep 25, 2018Updated 7 years ago
- UCAS course-evaluation; 国科大课程自动评价脚本☆21Jun 9, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Formation control of multiple unicycle robots (MATLAB)☆32Apr 5, 2017Updated 9 years ago
- Matlab code to control underactuated systems based on a hybrid approach that combines neural networks, reinforcement learning, fuzzy logi…☆31Nov 28, 2013Updated 12 years ago
- Iterative Closest Point algorith implementation to register 2 point clouds☆26Jan 26, 2025Updated last year
- MATLAB code for the numerical example in ``Event-Triggered Consensus for Multi-Agent Systems with Guaranteed Robustly Positive Minimum In…☆38Mar 9, 2019Updated 7 years ago
- Ultra low cost 3D printable SCARA arm☆18Jul 30, 2015Updated 10 years ago
- This repository contains the code for GMR-based Gaussian process.☆23Oct 7, 2019Updated 6 years ago
- Needs update☆22Sep 6, 2019Updated 6 years ago