ajkeith / Cyber-Air-Defense
Counterfactual regret minimization for multi-domain operations
☆8Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Cyber-Air-Defense
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆16Updated 6 months ago
- Code for magnetic mirror descent.☆15Updated last year
- APAC-Net☆6Updated 2 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆36Updated 3 years ago
- 3d manuver decision in air combat situations☆14Updated 2 years ago
- ☆17Updated last year
- ☆12Updated 2 years ago
- Code exploring the use of reward machines in the context of cooperative multi-agent reinforcement learning.☆13Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆46Updated 2 months ago
- jinxinglim / Game-Theoretical-Approaches-in-Multi-Agent-Reinforcement-Learning-Policy-Space-Response-Oracles☆14Updated 5 years ago
- Online solver based on Monte Carlo tree search for POMDPs with continuous state, action, and observation spaces.☆54Updated 5 months ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆19Updated 2 years ago
- A deep reinforcement learning multi-agent algorithm, where a team learns to complete a task and communicate between agents.☆16Updated 3 years ago
- A NLTH Poker Agent using Monte-Carlo-Simulation☆12Updated 4 years ago
- ☆13Updated 2 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆27Updated 3 years ago
- ☆18Updated 3 years ago
- ☆9Updated 5 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆45Updated 5 years ago
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆15Updated last year
- A Neural Network Approach for Real-Time High-Dimensional Optimal Control☆25Updated 2 years ago
- Rapidly designing and solving differential games in Julia.☆35Updated 3 weeks ago
- FireCommander2020: A Multiagent, Interactive Joint Perception-Action Reconnaissance Environment☆15Updated 2 years ago
- A collection of environments and reference agents for planning and reinforcement learning research in partially observable, multi-agent …☆17Updated last week
- Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)☆10Updated last year
- ☆11Updated 2 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆73Updated 5 years ago
- Revisiting Discrete Gradient Estimation in MADDPG☆23Updated last year
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Updated 3 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 5 years ago