Deep learning implementations (Asynchronous Deep Q-Learning) of multiple Game Theory algorithms for adversarial learning (WoLF-PHC, GIGA-WoLF, WPL, EMA-QL, PGA-APP)
☆15Sep 19, 2017Updated 8 years ago
Alternatives and similar repositories for Mixed-Policy-Asynchronous-Deep-Q-Learning
Users that are interested in Mixed-Policy-Asynchronous-Deep-Q-Learning are comparing it to the libraries listed below
Sorting:
- Using WoLF (win or learn fast) PHC (policy hill climbing) algorithm to implement stochastic games☆15Jun 14, 2019Updated 6 years ago
- Source code for journal paper "Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer"☆13Dec 26, 2017Updated 8 years ago
- Multi-agent reinforcement learning programs based on Game theory☆42Feb 11, 2023Updated 3 years ago
- 2019 Fall - Game theory and Multi-agent RL Termproject☆10Dec 13, 2019Updated 6 years ago
- Testing different RL algorithms for multi-agent environments. From SARSA, QLearning to Independent Q-Learning, Joint Action Learning and …☆12Mar 29, 2019Updated 6 years ago
- 强化学习中纳什Qlearning 实现矩阵博弈☆30Feb 25, 2019Updated 7 years ago
- Nash Q Learning☆32Nov 23, 2020Updated 5 years ago
- Exploring the Dyna-Q reinforcement learning algorithm☆17Feb 27, 2018Updated 8 years ago
- This is the code provided to support the paper of "Stackelberg Game Theory Based Optimization Model for the Design of Payment Mechanism i…☆26Jul 26, 2019Updated 6 years ago
- ☆30Jun 3, 2022Updated 3 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Dec 26, 2017Updated 8 years ago
- M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning☆28Nov 5, 2020Updated 5 years ago
- original source code of the ASE 2019 paper: Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning☆28Jun 8, 2020Updated 5 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆33Dec 7, 2024Updated last year
- ☆13Dec 13, 2024Updated last year
- 基于Dijkstra算法的武汉地铁路径规划☆10Jul 1, 2022Updated 3 years ago
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆32Apr 25, 2022Updated 3 years ago
- ☆14Aug 12, 2024Updated last year
- NuART-Py: Python Library of Adaptive Resonance Theory Neural Network☆10Jan 26, 2020Updated 6 years ago
- A Caffe/C++ implementation of Deep Deterministic Policy Gradient☆10Feb 1, 2019Updated 7 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Official PyTorch Implementation of Federated Learning with Positive and Unlabeled Data☆10Aug 12, 2022Updated 3 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- Simple Redux like state management library based on RxJs.☆11Dec 14, 2017Updated 8 years ago
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago
- ADAPTIVE RESONANCE THEORY. Gail A. Carpenter and Stephen Grossberg☆10Feb 10, 2015Updated 11 years ago
- Inverse Reinforcement Learning, Inverse Optimal Control, Apprenticeship Learning, Imitation Learning review☆46Apr 27, 2021Updated 4 years ago
- yet another reinforcement learning package☆12May 24, 2022Updated 3 years ago
- PyTorch implementation of the "Learning an Adaptive Learning Rate Schedule" paper found here: https://arxiv.org/abs/1909.09712.☆12Jan 15, 2020Updated 6 years ago
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Oct 3, 2023Updated 2 years ago
- 使用强化学习算法Q-learning,对3D打印的路径进行规划,减少打印喷头转弯、启停,提高打印效率。☆12Jun 30, 2021Updated 4 years ago
- Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow - Tensorlfow Im…☆13Feb 2, 2019Updated 7 years ago
- ☆10Jul 20, 2020Updated 5 years ago
- Official implementation of Recurrent Action Transformer with Memory, an offline RL agent with memory mechanisms. https://sites.google.com…☆18Nov 23, 2025Updated 3 months ago
- TransMix: Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning☆11Oct 18, 2022Updated 3 years ago
- ☆12Mar 6, 2023Updated 2 years ago
- this is for visual servoing of a turtlebot combined with navigation management☆13Feb 11, 2019Updated 7 years ago
- Google AI Research☆10Mar 11, 2020Updated 5 years ago
- Official implementation for the paper "Sample-Then-Optimize Batch Neural Thompson Sampling", published at NeurIPS 2022.☆10Oct 13, 2022Updated 3 years ago