AnyLeoPeace / DURLECA
The released code for DUal-objective Reinforcement-Learning Epidemic Control Agent.
☆29Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for DURLECA
- ☆12Updated 4 months ago
- Paper list of multi-agent reinforcement learning (MARL)☆25Updated 3 years ago
- The submission template for the Learning to Dispatch and Reposition Competition @ KDD2020.☆85Updated 3 years ago
- ☆26Updated 4 years ago
- References at the Intersection of Causality and Reinforcement Learning☆88Updated 4 years ago
- A Spatio-temporal point process simulator.☆45Updated last year
- Multi Type Mean Field Reinforcement Learning☆30Updated 2 years ago
- Identify the causality for air pollution in China (North China, Yangtze River Delta, and Pearl River Delta)☆15Updated 4 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆36Updated 6 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 4 years ago
- ☆43Updated 2 years ago
- ☆97Updated 3 years ago
- reproduce some RL or Multi-Agent models☆35Updated 5 years ago
- ☆77Updated 3 years ago
- Transportation data online prediction☆46Updated 3 years ago
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆37Updated 3 years ago
- Experiments on a discrete mean field game model of population dynamics with reinforcement learning☆31Updated last year
- Paper Collection of Reinforcement Learning Exploration covers Exploration of Muti-Arm-Bandit, Reinforcement Learning and Multi-agent Rein…☆34Updated 5 years ago
- A general framework for learning spatio-temporal point processes via reinforcement learning☆28Updated 3 years ago
- Implementation of Deepmind's LaserTag-v0 game in A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning(2017)☆18Updated 5 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated last year
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆48Updated 5 years ago
- FEN Code☆37Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆61Updated 3 years ago
- Bayesian Optimization Meets Bayesian Optimal Stopping☆30Updated 4 years ago
- ☆40Updated 2 years ago
- ☆25Updated 3 years ago
- ☆23Updated 3 years ago
- Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…☆63Updated last year