(NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.
☆28Nov 19, 2021Updated 4 years ago
Alternatives and similar repositories for NAC
Users that are interested in NAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22May 20, 2021Updated 4 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆24Feb 27, 2022Updated 4 years ago
- ☆10Apr 23, 2021Updated 4 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Nov 3, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆55Aug 30, 2024Updated last year
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆15Apr 25, 2024Updated last year
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- A collection of deep reinforcement learning algorithm implementations☆11Jan 9, 2020Updated 6 years ago
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- ☆12Apr 17, 2024Updated 2 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- A System-Oriented Wargame Framework for Adversarial ML☆10Apr 24, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆38Mar 28, 2022Updated 4 years ago
- Efficient numpy-like ragged array datatype for Python.☆20May 2, 2023Updated 2 years ago
- This is the numerical approach proposed in the paper "Optimal Incentives to Mitigate Epidemics: A Stackelberg Mean Field Game Approach" b…☆13Nov 22, 2021Updated 4 years ago
- Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"☆52Nov 14, 2023Updated 2 years ago
- ☆11Nov 29, 2021Updated 4 years ago
- Code for NeurIPS paper "Self-Organized Group for Cooperative Multi-agentReinforcement Learning".☆22Feb 20, 2023Updated 3 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆66May 22, 2021Updated 4 years ago
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆136Oct 26, 2023Updated 2 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63May 20, 2020Updated 5 years ago
- ☆117Apr 15, 2023Updated 3 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- established for the data normalization and reinforcement learning training scheme to train an agent in DCS world☆12Oct 22, 2021Updated 4 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Mar 8, 2024Updated 2 years ago
- A parallel framework for population-based multi-agent reinforcement learning.☆551Dec 14, 2023Updated 2 years ago
- 数据科学与人工智能中文讲义☆14Apr 6, 2026Updated last week
- ☆12Aug 28, 2020Updated 5 years ago
- ☆16Mar 24, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆22Apr 22, 2024Updated last year
- Sccheduling Environment for Multi-Robot Coordination Problems☆18May 9, 2022Updated 3 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Oct 18, 2022Updated 3 years ago
- Certifying Some Distributional Robustness with Principled Adversarial Training (https://arxiv.org/abs/1710.10571)☆45May 1, 2018Updated 7 years ago
- DepTrim automatically specializes the software supply chain of dependencies in Maven projects https://arxiv.org/pdf/2302.08370☆15Updated this week
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆23Apr 16, 2022Updated 4 years ago