levelupai / rl-slg

Reinforcement learning training project for a SLG game

☆12

Alternatives and similar repositories for rl-slg:

Users that are interested in rl-slg are comparing it to the libraries listed below

lns / dapo
Source code for the paper "Divergence-Augmented Policy Optimization"
☆37Updated 5 years ago
tencent-ailab / TLeague
☆4Updated 3 months ago
ShibiHe / Poker-Fictitious-Play
Fictitious Self-play & Reinforcement Learning
☆18Updated 7 years ago
apsdehal / ic3net-envs
Environments with IC3Net paper
☆12Updated 6 years ago
younggyoseo / pytorch-nfsp
Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)
☆46Updated 6 years ago
lns / memoire
☆18Updated 5 years ago
rommeoijcai2019 / rommeo
IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)
☆23Updated 2 years ago
wwxFromTju / DRL_trick
☆33Updated 7 years ago
wangyuhuix / TrulyPPO
☆30Updated 2 years ago
YeTianJHU / GSCU
Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)
☆20Updated 2 years ago
inspirai / wilderness-scavenger
A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.
☆57Updated 2 years ago
jidiai / Competition_Football
☆12Updated 2 years ago
staghuntrpg / RPG
This is the source code of RPG (Reward-Randomized Policy Gradient)
☆43Updated 2 years ago
lanyavik / BAIL
☆17Updated 2 years ago
nanxintin / StarCraft-AI
Reinforcement Learning and Transfer Learning based StarCraft Micromanagement
☆45Updated 7 years ago
createamind / Distributed-DRL
Distributed Deep Reinforcement Learning
☆29Updated 4 years ago
seolhokim / DistributedRL-Pytorch-Ray
Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)
☆26Updated 2 years ago
toshikwa / rltorch
A simple framework for distributed reinforcement learning in PyTorch.
☆16Updated 4 years ago
suyoung-lee / Episodic-Backward-Update
Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.
☆17Updated 5 years ago
xionghuichen / RLAssistant
RLA is a tool for managing your RL experiments automatically
☆71Updated 2 years ago
davidbrandfonbrener / onestep-rl
☆42Updated 3 years ago
wenzhe-li / romi
Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"
☆19Updated 3 years ago
jparkerholder / DvD_ES
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆44Updated 4 years ago
ChengTsang / PPO-clip-and-PPO-penalty-on-Atari-Domain
Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty
☆56Updated 6 years ago
jiangsy / slbo_pytorch
☆15Updated 4 years ago
diversepsro / diverse_psro
☆18Updated 3 years ago
Johannes-H / nfsp-leduc
Neural Fictitious Self-Play in Leduc Holdem
☆11Updated 6 years ago
LihaoR / Entropy-Regularized-RL
soft q learning and soft actor critic
☆15Updated 6 years ago
NeteaseFuxiRL / wuji
original source code of the ASE 2019 paper: Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning
☆27Updated 4 years ago
jidiai / Competition_Olympics-Integrated
☆25Updated 2 years ago