Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games. Zihan Ding, Dijia Su, Qinghua Liu, Chi Jin
☆21Aug 26, 2022Updated 3 years ago
Alternatives and similar repositories for nash-dqn
Users that are interested in nash-dqn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Mar 8, 2024Updated 2 years ago
- Deep Reinforcement Learning for Nash Equilibria☆45Oct 25, 2022Updated 3 years ago
- Highway-Env Agent using DQN☆19May 29, 2022Updated 4 years ago
- A System-Oriented Wargame Framework for Adversarial ML☆10Apr 24, 2023Updated 3 years ago
- ☆16Jul 13, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Computation offloading is a technique to circumvent device restrictions and bring novel, computationally-intensive applications to a hete…☆22Feb 4, 2026Updated 3 months ago
- A deep reinforcement learning multi-agent algorithm, where a team learns to complete a task and communicate between agents.☆16Jun 1, 2021Updated 4 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆23Nov 18, 2022Updated 3 years ago
- SYMBXRL: Symbolic Explainable Deep Reinforcement Learning for Mobile Networks☆22Apr 1, 2026Updated last month
- ☆13Oct 11, 2022Updated 3 years ago
- Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking.☆42Aug 30, 2023Updated 2 years ago
- Core interface to design, solve, and simulate trajectory games.☆21Dec 6, 2024Updated last year
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- A custom interior point solver for mixed complementarity problems.☆19Apr 20, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A neural network accelerated solver for mixed-strategy solutions of trajectory games. Do you even lift?☆18Jun 22, 2025Updated 11 months ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆55Apr 21, 2023Updated 3 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆46Oct 29, 2020Updated 5 years ago
- Co-evolution of agents and environments in GVG-AI☆17Aug 12, 2021Updated 4 years ago
- Code accompanying paper "Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinf…☆15Dec 2, 2023Updated 2 years ago
- ☆12Aug 12, 2022Updated 3 years ago
- ☆18Jul 10, 2022Updated 3 years ago
- ☆16Mar 24, 2023Updated 3 years ago
- Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning☆11Jun 16, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The codes are to solve related problems based on E-CARGO, Group Role Assignment with Constraints.☆24Aug 21, 2022Updated 3 years ago
- DtnSim: A Python-based simulator for Delay Tolerant Networking☆16Nov 11, 2022Updated 3 years ago
- STL源码剖析学习笔记☆11Jun 28, 2022Updated 3 years ago
- Code related to the Neural-Swarm (ICRA 2020, Journal) papers☆29Mar 23, 2022Updated 4 years ago
- General Board Game Playing☆25Jun 16, 2025Updated 11 months ago
- Multi-agent occlusion inference using observed driver behaviors. A driver sensor model is learned using a conditional variational autoenc…☆24Apr 7, 2022Updated 4 years ago
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆23Apr 16, 2022Updated 4 years ago
- ☆25Jan 13, 2022Updated 4 years ago
- Simulate one server for one user, use PPO.☆15Nov 21, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆19Jan 8, 2025Updated last year
- ☆25Feb 21, 2022Updated 4 years ago
- Asymmetric methods for partially observable reinforcement learning☆10Jun 9, 2025Updated 11 months ago
- An unofficial implementation for online decision transformer☆41Sep 20, 2022Updated 3 years ago
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆12Jul 29, 2023Updated 2 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- An open source benchmark for Multi Agent Reinforcement Learning☆31Jul 15, 2023Updated 2 years ago