FrostHan / vlogLinks
varitional oracle guiding for reinforcement learning
☆12Updated 3 years ago
Alternatives and similar repositories for vlog
Users that are interested in vlog are comparing it to the libraries listed below
Sorting:
- ☆23Updated 3 years ago
- C++版日麻. Japanese Riichi Mahjong written in C++.☆118Updated 9 months ago
- 三人麻雀用AI☆17Updated last year
- 日本リーチ麻雀のゲームサーバー☆17Updated 2 years ago
- ☆13Updated 4 years ago
- Utility tools for tenhou.net log☆30Updated last year
- Deep reinforcement learning of mahjong self-play☆17Updated 7 years ago
- ☆11Updated 3 years ago
- Scripts for downloading logs from tenhou.net☆57Updated 9 years ago
- ☆45Updated 2 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆22Updated last year
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆39Updated 4 years ago
- An AI for 3-player Mahjong (Sanma) using deep reinforcement learning☆39Updated last year
- Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow☆20Updated 4 years ago
- ☆13Updated 3 years ago
- ☆14Updated 3 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Updated 4 years ago
- FQF(Fully parameterized Quantile Function for distributional reinforcement learning) is a general reinforcement learning framework for At…☆46Updated 5 years ago
- Mahjong4RL is a project that recreates the game of Japanese Mahjong and use deep reinforcement learning to play it.☆12Updated 3 years ago
- This is the code for Q-value Path Decomposition for Deep Multiagent Reinforcement Learning (NeurIPS 2019).☆12Updated 6 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆33Updated 5 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆164Updated 2 years ago
- Riichi Mahjong Kit: (1) Game log crawler (sqlite3, json, bs4); (2) Game log preprocessor; (3) Deterministic algorithms library☆53Updated 6 years ago
- Reinforcement learning (RL) implementation of imperfect information game Mahjong using markov decision processes to predict future game s…☆94Updated 3 years ago
- Scripts to download phoenix logs from tenhou.net☆41Updated 2 weeks ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆54Updated last year
- A tool for calculating the shanten number in Japanese mahjong.☆42Updated this week
- ☆17Updated 2 years ago
- Benchmarked implementations of Offline Multi-Agent RL Algorithms based on PyMARL codebase.☆30Updated last year
- ☆17Updated 2 years ago