levelupai / rl-slg
Reinforcement learning training project for a SLG game
☆12Updated 7 years ago
Alternatives and similar repositories for rl-slg:
Users that are interested in rl-slg are comparing it to the libraries listed below
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- ☆4Updated 3 months ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Environments with IC3Net paper☆12Updated 6 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- ☆18Updated 5 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Updated 2 years ago
- ☆33Updated 7 years ago
- ☆30Updated 2 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆20Updated 2 years ago
- A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.☆57Updated 2 years ago
- ☆12Updated 2 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- ☆17Updated 2 years ago
- Reinforcement Learning and Transfer Learning based StarCraft Micromanagement☆45Updated 7 years ago
- Distributed Deep Reinforcement Learning☆29Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- A simple framework for distributed reinforcement learning in PyTorch.☆16Updated 4 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆17Updated 5 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- ☆42Updated 3 years ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆19Updated 3 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- ☆15Updated 4 years ago
- ☆18Updated 3 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Updated 6 years ago
- soft q learning and soft actor critic☆15Updated 6 years ago
- original source code of the ASE 2019 paper: Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning☆27Updated 4 years ago
- ☆25Updated 2 years ago