FrostHan / vlogLinks
varitional oracle guiding for reinforcement learning
☆12Updated 3 years ago
Alternatives and similar repositories for vlog
Users that are interested in vlog are comparing it to the libraries listed below
Sorting:
- ☆22Updated 3 years ago
- C++版日麻. Japanese Riichi Mahjong written in C++.☆117Updated 7 months ago
- 三人麻雀用AI☆17Updated last year
- ☆13Updated 3 years ago
- Deep reinforcement learning of mahjong self-play☆17Updated 7 years ago
- Counterfactual regret minimization for multi-domain operations☆8Updated 4 years ago
- Utility tools for tenhou.net log☆30Updated last year
- User Interface of Mahjong AI☆17Updated 3 years ago
- 日本リーチ麻雀のゲームサーバー☆17Updated 2 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Updated 4 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆20Updated last year
- Riichi Mahjong Kit: (1) Game log crawler (sqlite3, json, bs4); (2) Game log preprocessor; (3) Deterministic algorithms library☆51Updated 6 years ago
- ☆11Updated 3 years ago
- Artificial Intelligence for Japanese mahjong☆270Updated 3 years ago
- Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow☆20Updated 3 years ago
- Scripts for downloading logs from tenhou.net☆57Updated 9 years ago
- C++ implementations of Counterfactual Regret Minimization and Monte Carlo CFR☆75Updated 3 years ago
- This is the code for Q-value Path Decomposition for Deep Multiagent Reinforcement Learning (NeurIPS 2019).☆11Updated 6 years ago
- ☆16Updated 2 years ago
- A deep reinforcement learning multi-agent algorithm, where a team learns to complete a task and communicate between agents.☆16Updated 4 years ago
- Reinforcement learning (RL) implementation of imperfect information game Mahjong using markov decision processes to predict future game s…☆91Updated 2 years ago
- Python wrapper for the Mahjong Soul (Majsoul) Protobuf objects. It allows to use their API.☆81Updated last year
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆39Updated 3 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆159Updated 2 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆33Updated 4 years ago
- ☆18Updated 4 years ago
- This is an implementation of DeepStack for No Limit Texas Hold'em, extended from DeepStack-Leduc.☆26Updated 6 years ago
- golang package for working with tenhou net logs and protocol☆13Updated 2 months ago
- Deep reinforcement learning with tensorflow2☆93Updated 2 months ago
- ☆14Updated 3 years ago