Snow-Dancing/ReinforcementLearning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Snow-Dancing/ReinforcementLearning)

Snow-Dancing / ReinforcementLearning

利用强化学习的Q价值迭代,Q学习以及SARSA方法解决小车爬山以及倒立摆的控制问题

☆14

Alternatives and similar repositories for ReinforcementLearning

Users that are interested in ReinforcementLearning are comparing it to the libraries listed below

Sorting:

dalek-who / Inverted-Pendulum
View on GitHub
强化学习大作业1 倒立摆
☆20Dec 8, 2022Updated 3 years ago
Cothrax / deepfool
View on GitHub
CFR-based Texas Hold'em AI
☆11Jan 30, 2021Updated 5 years ago
StevenSuzch / NeuroDynamicProgramming-MFD
View on GitHub
Source code of the Neuro-dynamic programming approach for optimal control of Macroscopic fundamental diagram (MFD) system)
☆10Aug 4, 2020Updated 5 years ago
wuyou33 / Dynamic-Modelling-Simulation-and-Control-of-Asymmetrical-VTOL-Multi-Rotor-UAVs
View on GitHub
Master's Thesis Project: Design, Development, Modelling and Simulating of a Y6 Multi-Rotor UAV, Imlementing Control Schemes such as Propo…
☆11Mar 23, 2020Updated 5 years ago
yiskw713 / VideoCaptioning
View on GitHub
video captioning using 3DCNN and LSTM (pytorch)
☆11Sep 26, 2019Updated 6 years ago
NVlabs / FRAG
View on GitHub
☆14Apr 25, 2025Updated 10 months ago
anair13 / bullet-manipulation-affordances
View on GitHub
☆13Jun 3, 2022Updated 3 years ago
OpenDocCN / dsai-notes-zh
View on GitHub
数据科学与人工智能中文讲义
☆12Updated this week
yunzhongjie / DT-Hinf-Tracking
View on GitHub
H_inf tracking control for linear discrete-time systems using ADP
☆12Jun 6, 2020Updated 5 years ago
beta-robots / gocator_3100
View on GitHub
Point cloud capture of Gocator3100 device
☆13Dec 20, 2016Updated 9 years ago
robintyh1 / neurips2021-meta-gradient-offpolicy-evaluation
View on GitHub
Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021
☆13Nov 3, 2021Updated 4 years ago
happy2h / MFAILC
View on GitHub
model free adaptive iterative learning control
☆14Nov 17, 2021Updated 4 years ago
andylei77 / lanenet-lane-detection
View on GitHub
Implemention of lanenet model for real time lane detection using deep neural network model https://maybeshewill-cv.github.io/lanenet-lane…
☆15Feb 28, 2019Updated 7 years ago
ericmauro / MS-Thesis-ADP-for-Human-Balance
View on GitHub
MATLAB simulation and final report/presentation for M.S. thesis. "Adaptive Dynamic Programming for Human Postural Balance Control"
☆18May 24, 2018Updated 7 years ago
LijunRio / Pattern_Classification
View on GitHub
中国科学院大学刘成林老师模式识别
☆12Jan 7, 2021Updated 5 years ago
DTennant / Incremental-Generalized-Category-Discovery
View on GitHub
☆15Oct 27, 2023Updated 2 years ago
suyoung-lee / LDM
View on GitHub
Latent Dynamics Mixture, NeurIPS 2021
☆18Oct 25, 2022Updated 3 years ago
ZQuang2202 / NonlinearRISE-IRL-BTs
View on GitHub
This is a official code implementation for Nonlinear RISE based Integral Reinforcement Learning algorithms for perturbed Bilateral Teleop…
☆24Mar 26, 2025Updated 11 months ago
mpritzkoleit / pygent
View on GitHub
Thesis: Application of Reinforcement Learning for the Control of Nonlinear Dynamical Systems
☆18Apr 16, 2020Updated 5 years ago
comkieffer / Thesis
View on GitHub
Data-driven attitude control design for multirotor UAVs
☆20May 1, 2017Updated 8 years ago
sjtu-marl / bd_rd_psro
View on GitHub
Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
☆24Feb 27, 2022Updated 4 years ago
diversepsro / diverse_psro
View on GitHub
☆22May 20, 2021Updated 4 years ago
GaryStack / MMR-V
View on GitHub
Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?
☆38Jun 23, 2025Updated 8 months ago
rpSebastian / AutoCFR
View on GitHub
Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)
☆22Apr 22, 2024Updated last year
happy2h / HOMFAILC
View on GitHub
High Order Model Free Adaptive Iterative Learning Control matlab code
☆21Nov 17, 2021Updated 4 years ago
zjysteven / MixOE
View on GitHub
[WACV'23] Mixture Outlier Exposure for Out-of-Distribution Detection in Fine-grained Environments
☆27Apr 12, 2023Updated 2 years ago
tor4z / awesome-confidence-calibration
View on GitHub
awesome confidence calibration paper list
☆25Oct 21, 2021Updated 4 years ago
Rajshah05 / Behaviour-Optimization-for-Swarm-Navigation-in-Cluttered-Environment
View on GitHub
Optimized settling time in formation control of UAVs swarm navigation in the presence of obstacles
☆30Dec 20, 2019Updated 6 years ago
ZITingHUANG1 / DRL-MPC
View on GitHub
Integrating Reinforcement Learning and Model Predictive Control for Enhancing Safety in Automated Vehicle Systems
☆35Jan 2, 2024Updated 2 years ago
mashijie1028 / ProtoGCD
View on GitHub
(TPAMI 2025) ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery
☆34Jun 13, 2025Updated 8 months ago
AqibNasimM / Formation-Cotnrol-of-multi-agent-system
View on GitHub
MATLAB based implementation of ofrmation control of multi-agent system
☆28Mar 18, 2018Updated 7 years ago
sming256 / BOLT
View on GitHub
[CVPR2025] BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding
☆38Feb 5, 2026Updated 3 weeks ago
thesouther / myNotes
View on GitHub
notes
☆32Jun 28, 2022Updated 3 years ago
victordolk / eventtriggered-consensus
View on GitHub
MATLAB code for the numerical example in ``Event-Triggered Consensus for Multi-Agent Systems with Guaranteed Robustly Positive Minimum In…
☆37Mar 9, 2019Updated 6 years ago
Rondorf / BOReL
View on GitHub
Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 20…
☆31Nov 23, 2021Updated 4 years ago
mithun-bharadwaj / Adaptive_Optimal_Control
View on GitHub
Implementation of a paper on adaptive optimal control (solving ARE) based on policy iterations
☆30Aug 25, 2019Updated 6 years ago
LMozart / sumo-multiagent
View on GitHub
This is a multi agent reinforcement learning system using SUMO for large scale traffic light control
☆30May 20, 2020Updated 5 years ago
razo-zapata / fuzzy-RL-wavelet-networks
View on GitHub
Matlab code to control underactuated systems based on a hybrid approach that combines neural networks, reinforcement learning, fuzzy logi…
☆30Nov 28, 2013Updated 12 years ago
DRL-CASIA / GameAI-FightingAI
View on GitHub
☆35Sep 5, 2020Updated 5 years ago