利用强化学习的Q价值迭代,Q学习以及SARSA方法解决小车爬山以及倒立摆的控制问题
☆14Jul 25, 2019Updated 6 years ago
Alternatives and similar repositories for ReinforcementLearning
Users that are interested in ReinforcementLearning are comparing it to the libraries listed below
Sorting:
- 强化学习大作业1 倒立摆☆20Dec 8, 2022Updated 3 years ago
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- Source code of the Neuro-dynamic programming approach for optimal control of Macroscopic fundamental diagram (MFD) system)☆10Aug 4, 2020Updated 5 years ago
- Master's Thesis Project: Design, Development, Modelling and Simulating of a Y6 Multi-Rotor UAV, Imlementing Control Schemes such as Propo…☆11Mar 23, 2020Updated 5 years ago
- video captioning using 3DCNN and LSTM (pytorch)☆11Sep 26, 2019Updated 6 years ago
- ☆14Apr 25, 2025Updated 10 months ago
- ☆13Jun 3, 2022Updated 3 years ago
- 数据科学与人工智能中文讲义☆12Updated this week
- H_inf tracking control for linear discrete-time systems using ADP☆12Jun 6, 2020Updated 5 years ago
- Point cloud capture of Gocator3100 device☆13Dec 20, 2016Updated 9 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Nov 3, 2021Updated 4 years ago
- model free adaptive iterative learning control☆14Nov 17, 2021Updated 4 years ago
- Implemention of lanenet model for real time lane detection using deep neural network model https://maybeshewill-cv.github.io/lanenet-lane…☆15Feb 28, 2019Updated 7 years ago
- MATLAB simulation and final report/presentation for M.S. thesis. "Adaptive Dynamic Programming for Human Postural Balance Control"☆18May 24, 2018Updated 7 years ago
- 中国科学院大学刘成林老师模式识别☆12Jan 7, 2021Updated 5 years ago
- ☆15Oct 27, 2023Updated 2 years ago
- Latent Dynamics Mixture, NeurIPS 2021☆18Oct 25, 2022Updated 3 years ago
- This is a official code implementation for Nonlinear RISE based Integral Reinforcement Learning algorithms for perturbed Bilateral Teleop…☆24Mar 26, 2025Updated 11 months ago
- Thesis: Application of Reinforcement Learning for the Control of Nonlinear Dynamical Systems☆18Apr 16, 2020Updated 5 years ago
- Data-driven attitude control design for multirotor UAVs☆20May 1, 2017Updated 8 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆24Feb 27, 2022Updated 4 years ago
- ☆22May 20, 2021Updated 4 years ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆38Jun 23, 2025Updated 8 months ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆22Apr 22, 2024Updated last year
- High Order Model Free Adaptive Iterative Learning Control matlab code☆21Nov 17, 2021Updated 4 years ago
- [WACV'23] Mixture Outlier Exposure for Out-of-Distribution Detection in Fine-grained Environments☆27Apr 12, 2023Updated 2 years ago
- awesome confidence calibration paper list☆25Oct 21, 2021Updated 4 years ago
- Optimized settling time in formation control of UAVs swarm navigation in the presence of obstacles☆30Dec 20, 2019Updated 6 years ago
- Integrating Reinforcement Learning and Model Predictive Control for Enhancing Safety in Automated Vehicle Systems☆35Jan 2, 2024Updated 2 years ago
- (TPAMI 2025) ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery☆34Jun 13, 2025Updated 8 months ago
- MATLAB based implementation of ofrmation control of multi-agent system☆28Mar 18, 2018Updated 7 years ago
- [CVPR2025] BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding☆38Feb 5, 2026Updated 3 weeks ago
- notes☆32Jun 28, 2022Updated 3 years ago
- MATLAB code for the numerical example in ``Event-Triggered Consensus for Multi-Agent Systems with Guaranteed Robustly Positive Minimum In…☆37Mar 9, 2019Updated 6 years ago
- Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 20…☆31Nov 23, 2021Updated 4 years ago
- Implementation of a paper on adaptive optimal control (solving ARE) based on policy iterations☆30Aug 25, 2019Updated 6 years ago
- This is a multi agent reinforcement learning system using SUMO for large scale traffic light control☆30May 20, 2020Updated 5 years ago
- Matlab code to control underactuated systems based on a hybrid approach that combines neural networks, reinforcement learning, fuzzy logi…☆30Nov 28, 2013Updated 12 years ago
- ☆35Sep 5, 2020Updated 5 years ago