☆10Apr 23, 2021Updated 5 years ago
Alternatives and similar repositories for OnlineDoubleOracle
Users that are interested in OnlineDoubleOracle are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- ☆22May 20, 2021Updated 4 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆56Aug 30, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Jul 30, 2024Updated last year
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Nov 19, 2021Updated 4 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- Demo for Plan Recognition as Planning over Classical Action Theories☆12Dec 29, 2016Updated 9 years ago
- ☆16Feb 23, 2024Updated 2 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆24Feb 27, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Jun 17, 2022Updated 3 years ago
- ☆16Jul 13, 2022Updated 3 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- (AAAI24 oral) Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)☆12May 22, 2023Updated 2 years ago
- GAIL learning to imitate PPO playing CartPole.☆13May 27, 2021Updated 4 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆15Apr 25, 2024Updated 2 years ago
- Kuhn poker implemented in accordance to OpenAI gym interface☆14Dec 5, 2019Updated 6 years ago
- ☆18Nov 16, 2020Updated 5 years ago
- ☆16Mar 24, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 强化学习中纳什Qlearning 实 现矩阵博弈☆30Feb 25, 2019Updated 7 years ago
- ☆10Apr 26, 2023Updated 3 years ago
- Code for "Training Adversarially Robust Sparse Networks via Bayesian Connectivity Sampling" [ICML 2021]☆10Mar 14, 2022Updated 4 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆15Jan 18, 2022Updated 4 years ago
- Python implementations of counterfactual regret minimization exercises found here: http://modelai.gettysburg.edu/2013/cfr/☆10Apr 27, 2017Updated 9 years ago
- Source code for the Joint Shapley values: a measure of joint feature importance☆12Sep 14, 2021Updated 4 years ago
- using information theory to encourage agents to cooperate and compete☆19Oct 4, 2018Updated 7 years ago
- ☆16Oct 6, 2019Updated 6 years ago
- Data Driven Dynamic Hybrid Renewable Energy design and simulation framework☆12May 5, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICLR 2023] PyTorch code for DFPC: Data flow driven pruning of coupled channels without data.☆15Aug 25, 2023Updated 2 years ago
- 亚马逊棋冠军程序细节☆13Jan 7, 2026Updated 3 months ago
- ☆10Oct 10, 2018Updated 7 years ago
- Using Natural Language for Reward Shaping in Reinforcement Learning☆24Dec 11, 2023Updated 2 years ago
- 微软创新杯参赛作品,用C#语言,Unity 3D游戏引擎和Vuforia AR引擎制作的一款解密类AR小游戏☆13Mar 13, 2018Updated 8 years ago
- Data extract of the DoD Procurement (P-1) and RDTE (R-1) justification book exhibits submitted by the US DoD Military Departments and Def…☆13Jan 3, 2019Updated 7 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆22Apr 22, 2024Updated 2 years ago