Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games. Zihan Ding, Dijia Su, Qinghua Liu, Chi Jin
☆22Aug 26, 2022Updated 3 years ago
Alternatives and similar repositories for nash-dqn
Users that are interested in nash-dqn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆51Mar 8, 2024Updated 2 years ago
- ☆22May 20, 2021Updated 5 years ago
- Highway-Env Agent using DQN☆19May 29, 2022Updated 4 years ago
- ☆16Jul 13, 2022Updated 3 years ago
- Computation offloading is a technique to circumvent device restrictions and bring novel, computationally-intensive applications to a hete…☆22Jun 9, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A deep reinforcement learning multi-agent algorithm, where a team learns to complete a task and communicate between agents.☆16Jun 1, 2021Updated 5 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆24Nov 18, 2022Updated 3 years ago
- SYMBXRL: Symbolic Explainable Deep Reinforcement Learning for Mobile Networks☆22Apr 1, 2026Updated 2 months ago
- ☆13Oct 11, 2022Updated 3 years ago
- Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking.☆42Aug 30, 2023Updated 2 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- A custom interior point solver for mixed complementarity problems.☆20Apr 20, 2026Updated last month
- A neural network accelerated solver for mixed-strategy solutions of trajectory games. Do you even lift?☆18Jun 22, 2025Updated 11 months ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆55Apr 21, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆46Oct 29, 2020Updated 5 years ago
- Code accompanying paper "Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinf…☆15Dec 2, 2023Updated 2 years ago
- Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning☆11Jun 16, 2022Updated 4 years ago
- The codes are to solve related problems based on E-CARGO, Group Role Assignment with Constraints.☆24Aug 21, 2022Updated 3 years ago
- DtnSim: A Python-based simulator for Delay Tolerant Networking☆16Nov 11, 2022Updated 3 years ago
- AlphaHydrogen is an open source OpenAI Gym environment that simulates the energy system of a residential community with distributed renew…☆17Oct 5, 2021Updated 4 years ago
- Code related to the Neural-Swarm (ICRA 2020, Journal) papers☆29Mar 23, 2022Updated 4 years ago
- General Board Game Playing☆25Jun 16, 2025Updated last year
- ☆10Apr 13, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…☆46Oct 13, 2024Updated last year
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆23Apr 16, 2022Updated 4 years ago
- 本项目展示了2022年部分信息检索/数据挖掘顶会论文分类。☆17Jun 13, 2022Updated 4 years ago
- Simulate one server for one user, use PPO.☆15Nov 21, 2021Updated 4 years ago
- ☆19Jan 8, 2025Updated last year
- ☆25Feb 21, 2022Updated 4 years ago
- Asymmetric methods for partially observable reinforcement learning☆10Jun 9, 2025Updated last year
- An unofficial implementation for online decision transformer☆41Sep 20, 2022Updated 3 years ago
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆12Jul 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆30Aug 20, 2021Updated 4 years ago
- ☆28Jul 15, 2022Updated 3 years ago
- A tool for experimenting with evolutionary optimization methods for machine learning algorithms, by distributing the workload over a larg…☆14Dec 19, 2018Updated 7 years ago
- multi-agent pathfinding via dqn☆16May 19, 2021Updated 5 years ago
- Unofficial minimal implementation of consistency models (CM) proposed by Song et al. 2023 on a 1D toy task in pytorch☆21May 2, 2023Updated 3 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- Federated Learning Experiments for Remote Sensing image data using convolution neural networks☆16Aug 5, 2021Updated 4 years ago