Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games. Zihan Ding, Dijia Su, Qinghua Liu, Chi Jin
☆20Aug 26, 2022Updated 3 years ago
Alternatives and similar repositories for nash-dqn
Users that are interested in nash-dqn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Reinforcement Learning for Nash Equilibria☆45Oct 25, 2022Updated 3 years ago
- ☆22May 20, 2021Updated 4 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- Highway-Env Agent using DQN☆19May 29, 2022Updated 3 years ago
- A System-Oriented Wargame Framework for Adversarial ML☆10Apr 24, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆16Jul 13, 2022Updated 3 years ago
- Computation offloading is a technique to circumvent device restrictions and bring novel, computationally-intensive applications to a hete…☆22Feb 4, 2026Updated 2 months ago
- A deep reinforcement learning multi-agent algorithm, where a team learns to complete a task and communicate between agents.☆16Jun 1, 2021Updated 4 years ago
- SYMBXRL: Symbolic Explainable Deep Reinforcement Learning for Mobile Networks☆21Apr 1, 2026Updated 2 weeks ago
- ☆13Oct 11, 2022Updated 3 years ago
- Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking.☆42Aug 30, 2023Updated 2 years ago
- Core interface to design, solve, and simulate trajectory games.☆21Dec 6, 2024Updated last year
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- A custom interior point solver for mixed complementarity problems.☆18Mar 9, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆54Apr 21, 2023Updated 2 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆46Oct 29, 2020Updated 5 years ago
- Co-evolution of agents and environments in GVG-AI☆17Aug 12, 2021Updated 4 years ago
- ☆12Aug 12, 2022Updated 3 years ago
- ☆18Jul 10, 2022Updated 3 years ago
- ☆16Mar 24, 2023Updated 3 years ago
- Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning☆11Jun 16, 2022Updated 3 years ago
- DtnSim: A Python-based simulator for Delay Tolerant Networking☆16Nov 11, 2022Updated 3 years ago
- AlphaHydrogen is an open source OpenAI Gym environment that simulates the energy system of a residential community with distributed renew…☆17Oct 5, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code related to the Neural-Swarm (ICRA 2020, Journal) papers☆29Mar 23, 2022Updated 4 years ago
- General Board Game Playing☆25Jun 16, 2025Updated 10 months ago
- ☆13Jan 26, 2023Updated 3 years ago
- Multi-agent occlusion inference using observed driver behaviors. A driver sensor model is learned using a conditional variational autoenc…☆23Apr 7, 2022Updated 4 years ago
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆23Apr 16, 2022Updated 4 years ago
- ☆25Jan 13, 2022Updated 4 years ago
- Simulate one server for one user, use PPO.☆15Nov 21, 2021Updated 4 years ago
- ☆19Jan 8, 2025Updated last year
- ☆25Jul 15, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆25Feb 21, 2022Updated 4 years ago
- Asymmetric methods for partially observable reinforcement learning☆10Jun 9, 2025Updated 10 months ago
- An unofficial implementation for online decision transformer☆41Sep 20, 2022Updated 3 years ago
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆12Jul 29, 2023Updated 2 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- An open source benchmark for Multi Agent Reinforcement Learning☆31Jul 15, 2023Updated 2 years ago
- A tool for experimenting with evolutionary optimization methods for machine learning algorithms, by distributing the workload over a larg…☆14Dec 19, 2018Updated 7 years ago