FrostHan / vlogLinks
varitional oracle guiding for reinforcement learning
☆12Updated 3 years ago
Alternatives and similar repositories for vlog
Users that are interested in vlog are comparing it to the libraries listed below
Sorting:
- ☆23Updated 3 years ago
- C++版日麻. Japanese Riichi Mahjong written in C++.☆117Updated 7 months ago
- Utility tools for tenhou.net log☆30Updated last year
- 三人麻雀用AI☆17Updated last year
- ☆11Updated 3 years ago
- ☆13Updated 3 years ago
- Deep reinforcement learning of mahjong self-play☆17Updated 7 years ago
- Scripts for downloading logs from tenhou.net☆57Updated 9 years ago
- Riichi Mahjong Kit: (1) Game log crawler (sqlite3, json, bs4); (2) Game log preprocessor; (3) Deterministic algorithms library☆53Updated 6 years ago
- 日本リーチ麻雀のゲームサーバー☆17Updated 2 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆22Updated last year
- Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow☆20Updated 3 years ago
- Bot for tenhou.net riichi mahjong server written in Python☆203Updated 2 years ago
- Scripts to download phoenix logs from tenhou.net☆41Updated last year
- A tool for calculating the shanten number in Japanese mahjong.☆42Updated last month
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆33Updated 4 years ago
- FQF(Fully parameterized Quantile Function for distributional reinforcement learning) is a general reinforcement learning framework for At…☆44Updated 4 years ago
- Mahjong4RL is a project that recreates the game of Japanese Mahjong and use deep reinforcement learning to play it.☆12Updated 3 years ago
- This is the code for Q-value Path Decomposition for Deep Multiagent Reinforcement Learning (NeurIPS 2019).☆11Updated 6 years ago
- ☆45Updated 2 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆39Updated 4 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆163Updated 2 years ago
- ☆14Updated 3 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Updated 3 years ago
- ☆13Updated 3 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Updated 4 years ago
- ☆18Updated 4 years ago
- ☆17Updated 2 years ago
- Gated Transformer Model for Computer Vision☆23Updated 4 years ago
- ☆16Updated 2 years ago