zouchangjie/RL-Nash-Q-learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zouchangjie/RL-Nash-Q-learning)

zouchangjie / RL-Nash-Q-learning

强化学习中纳什Qlearning 实现矩阵博弈

☆30

Alternatives and similar repositories for RL-Nash-Q-learning

Users that are interested in RL-Nash-Q-learning are comparing it to the libraries listed below

Sorting:

jtonglet / Nash-Q-Learning
View on GitHub
Implementation of the Nash Q-Learning algorithm to solve simple MARL problems with two agents.
☆26Jan 3, 2023Updated 3 years ago
alireza-montazeri / AV-Nash-Q-Learning
View on GitHub
Implementation of Nash Q-Learning for Autonomous Vehicle Decision Making
☆16Sep 16, 2022Updated 3 years ago
sailik1991 / MarkovGameSolvers
View on GitHub
This is code for finding the minimax/nash/stackelberg strategy of players in Markov Games.
☆28Jun 26, 2025Updated 8 months ago
sudeepkatakol / MARL-CooperativeHunting
View on GitHub
Game Theory Course Project
☆10Dec 7, 2018Updated 7 years ago
tsaoyu / D3HRE
View on GitHub
Data Driven Dynamic Hybrid Renewable Energy design and simulation framework
☆12May 5, 2020Updated 5 years ago
npvoid / OnlineDoubleOracle
View on GitHub
☆11Apr 23, 2021Updated 4 years ago
Kim-Hammar / gym-optimal-intrusion-response
View on GitHub
A Simulated Optimal Intrusion Response Game
☆21Apr 3, 2022Updated 3 years ago
mainongtt / EMNProject
View on GitHub
爱恩斯坦棋代码
☆10Nov 24, 2020Updated 5 years ago
ZhuLinhai1996 / Multi_agent_Reinforcement_Learning
View on GitHub
多代理(Multi agent)强化学习Qlearning算法在多目标探测问题(任务分配+功率优化)中的应用
☆30May 22, 2019Updated 6 years ago
JHD1204 / RL_vehicle_control
View on GitHub
Using reinforcement learning for vehicle lateral control, the input is information such as vehicle status and tracking error, and the out…
☆14May 6, 2023Updated 2 years ago
PromethiumL / JustChord
View on GitHub
A Read-time MIDI visualization tool using PyQt
☆10Nov 24, 2020Updated 5 years ago
aalu1418 / differential-games
View on GitHub
Python implementation of differential games - starting from simple 2 body pursuit/evader problems to more advanced scenarios.
☆37Apr 19, 2020Updated 5 years ago
sailik1991 / StackelbergEquilibribumSolvers
View on GitHub
Solves a Mixed Integer Linear Program to generate the Stacklberg Equilibrium of a General-sum (+Bayesian) Games.
☆36Jan 9, 2026Updated 2 months ago
Say-Hello2y / MultiAgentSystem
View on GitHub
针对基本的一阶二阶多智能体控制，给出了基本的Matlab仿真
☆48Jul 5, 2022Updated 3 years ago
cppbear / EinsteinChess_UCT
View on GitHub
NJU程设实验项目三：爱因斯坦棋
☆10May 24, 2019Updated 6 years ago
JianmingGuo / Sicsp_ICS
View on GitHub
improve mulval to accommodate some updates and make it more suitable for industrial control network
☆12Nov 22, 2022Updated 3 years ago
sguerin13 / CNN_2D_CFD_ECE_228
View on GitHub
Predicting 2D Steady State Fluid Flow Fields using Convolutional Neural Networks
☆11Oct 3, 2020Updated 5 years ago
STOL-AMS / TO-22-Merge-Coordination
View on GitHub
Merge coordination aims to minimize the negative impacts of the merging process on the target lane. The shockwave magnitude and duration …
☆10Sep 21, 2020Updated 5 years ago
Xploror / GNC_LQG_missile_strike
View on GitHub
This project is about implementing an LQG algorithm on a missile whose state model is partially unstable and implementing Guidance Naviga…
☆10Jul 26, 2021Updated 4 years ago
GeoGroup / VoroRock
View on GitHub
Generation of columnar jointed rock using Voronoi method
☆10Dec 13, 2019Updated 6 years ago
ordovas / dice-scores-recognition
View on GitHub
Dice Scores Recognition in images and live video using CNN.
☆13Dec 19, 2020Updated 5 years ago
Zhao-Jichao / MAS_CooperativeClusterMotionControl
View on GitHub
《多智能体系统的协同群集运动控制》-陈杰
☆42Mar 20, 2021Updated 4 years ago
indylab / nxdo
View on GitHub
Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games
☆40Aug 27, 2021Updated 4 years ago
usman15997 / RL-controlled-Lights-and-I2V-SUMO
View on GitHub
This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…
☆10Oct 12, 2020Updated 5 years ago
pkufzh / NWPU_Aircraft_Engineering_Courses_Term_1_6
View on GitHub
NWPU飞行器设计与工程本科专业课（大一上）
☆13Jul 11, 2022Updated 3 years ago
Yuning-J / NVDFeatureAnalysis
View on GitHub
Correlate NVD datasets wIth CWE/CAPEC/CVSS labels for customised usage. Plus static analysis and data visualisation.
☆13Nov 17, 2023Updated 2 years ago
pranurs / fractal-dimensions
View on GitHub
Estimating the fractal dimension of an image using the box counting approach
☆14Oct 15, 2020Updated 5 years ago
sina33 / heft
View on GitHub
HEFT and CPOP task scheduling algorithms
☆12Dec 6, 2018Updated 7 years ago
Lupovis / DetectingCanaryTokens
View on GitHub
A Red Team Script to Detect Canary Tokens and Seed Files
☆15Jan 2, 2024Updated 2 years ago
sastejugaad / brahmos
View on GitHub
3d printed Model of fastest supersonic cruise missile in the world BraHmos. The BrahMos is a medium-range ramjet supersonic cruise missil…
☆14Feb 1, 2023Updated 3 years ago
apauna / RASSH
View on GitHub
RASSH – Reinforced Adaptive SSH Honeypot This is a project developed for my Phd Thesis and the target is to create an Adaptive Honeypot…
☆12Jul 29, 2019Updated 6 years ago
killian-34 / RobustRMAB
View on GitHub
Deep reinforcement learning + double oracle framework for Robust Restless Bandits
☆10Jul 4, 2021Updated 4 years ago
540co / dod-president-budget-procurement-rdte-data
View on GitHub
Data extract of the DoD Procurement (P-1) and RDTE (R-1) justification book exhibits submitted by the US DoD Military Departments and Def…
☆13Jan 3, 2019Updated 7 years ago
zuzhaoye / cellular-automata-traffic-flow-framework
View on GitHub
Cellular automata traffic simulation
☆11Jan 18, 2021Updated 5 years ago
ProSeCo-Planning / ros_proseco_planning
View on GitHub
The ROS interface as well as the Python packages for ProSeCo Planning
☆10Jun 17, 2024Updated last year
IGITUGraz / SparseAdversarialTraining
View on GitHub
Code for "Training Adversarially Robust Sparse Networks via Bayesian Connectivity Sampling" [ICML 2021]
☆10Mar 14, 2022Updated 3 years ago
gdmnl / SCARA-PPR
View on GitHub
The original code for SCARA: Scalable Graph Neural Networks with Feature-Oriented Optimization (VLDB 2022) and Scalable Decoupling Graph …
☆13Mar 8, 2024Updated 2 years ago
lazyZhou1997 / DreamLand
View on GitHub
微软创新杯参赛作品，用C#语言，Unity 3D游戏引擎和Vuforia AR引擎制作的一款解密类AR小游戏
☆13Mar 13, 2018Updated 7 years ago
Sadie-Zhao / Zero-Sum-Stochastic-Stackelberg-Games-NeurIPS
View on GitHub
This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".
☆16Oct 12, 2022Updated 3 years ago