Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"
☆71Aug 18, 2016Updated 9 years ago
Alternatives and similar repositories for opponent
Users that are interested in opponent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 2 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆97Aug 21, 2018Updated 7 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- ☆17Dec 19, 2019Updated 6 years ago
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆43Oct 5, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆21Nov 18, 2022Updated 3 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Oct 6, 2022Updated 3 years ago
- Design good curriculums for deep reinforcement learning☆14May 18, 2016Updated 9 years ago
- Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)☆19Jul 20, 2018Updated 7 years ago
- Implementation of the DreamerV2 agent in torch☆20Sep 4, 2022Updated 3 years ago
- ☆33Nov 21, 2022Updated 3 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Jan 15, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Neural Fictitious Self-Play in Leduc Holdem☆11Jul 4, 2018Updated 7 years ago
- Half Field Offense in Robocup 2D Soccer☆236Aug 31, 2022Updated 3 years ago
- ☆123Feb 16, 2023Updated 3 years ago
- Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks☆229Oct 3, 2023Updated 2 years ago
- Code for the paper "Emergent Complexity via Multi-agent Competition"☆836Apr 2, 2023Updated 3 years ago
- ☆11Mar 18, 2021Updated 5 years ago
- ☆18Apr 17, 2019Updated 6 years ago
- ☆10Apr 23, 2021Updated 4 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆24Feb 27, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆121Nov 4, 2024Updated last year
- Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning☆20Aug 12, 2021Updated 4 years ago
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- RoboCup Keepaway benchmark player framework.☆26Nov 3, 2014Updated 11 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆55Aug 30, 2024Updated last year
- Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆2,748Apr 9, 2024Updated 2 years ago
- Malmo Challenge☆69May 29, 2018Updated 7 years ago
- Teacher-Student Curriculum Learning code☆85Nov 24, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 多智能体即时策略对抗方法与实践 苏炯铭 刘鸿福 陈少飞 项凤涛 编著 科学出版社 2019.11 随书代码☆31Nov 17, 2020Updated 5 years ago
- (TG'2021) Code for paper "Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning". TG = Transact…☆10Apr 24, 2023Updated 2 years ago
- ☆108Feb 10, 2021Updated 5 years ago
- Environments with IC3Net paper☆15Jan 8, 2019Updated 7 years ago
- ☆57May 5, 2018Updated 7 years ago
- This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…☆55Nov 22, 2025Updated 4 months ago
- Inferring beliefs about dynamics from behavior☆30May 24, 2018Updated 7 years ago