Application of proximal policy optimization algorithm to the card game Big 2 using Tensorflow
☆83Oct 3, 2023Updated 2 years ago
Alternatives and similar repositories for big2_PPOalgorithm
Users that are interested in big2_PPOalgorithm are comparing it to the libraries listed below
Sorting:
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Jun 20, 2021Updated 4 years ago
- ☆45Oct 21, 2022Updated 3 years ago
- clear single-file JAX implementations of common RL algorithms☆16Sep 5, 2021Updated 4 years ago
- ☆18Nov 16, 2020Updated 5 years ago
- Leave No Trace is an algorithm for safe reinforcement learning.☆15Apr 30, 2018Updated 7 years ago
- Fictitious Self-play & Reinforcement Learning☆18Jan 26, 2018Updated 8 years ago
- D2BS, short for Diablo 2 Botting System, uses the open source Javascript engine named 'SpiderMonkey' to execute user scripts inside of Di…☆16Jul 9, 2016Updated 9 years ago
- UCI chess protocole API in java for GUI chess clients☆12Nov 11, 2015Updated 10 years ago
- ☆12Nov 18, 2023Updated 2 years ago
- ☆20Dec 10, 2018Updated 7 years ago
- A web application to visualize the history files produced by csv☆13Aug 31, 2014Updated 11 years ago
- Repository for experiments with TensorFlow☆11Nov 25, 2015Updated 10 years ago
- ☆25Jul 15, 2022Updated 3 years ago
- Cloud client for douzero training☆11Dec 26, 2021Updated 4 years ago
- Read and write mha files using Python☆10Oct 14, 2013Updated 12 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆31Jan 9, 2019Updated 7 years ago
- A2C for GVG-AI☆23Nov 7, 2018Updated 7 years ago
- Pydata MAB Tutorial☆10Jul 6, 2018Updated 7 years ago
- Windows hidden thread suspend POC with code injection☆12May 27, 2017Updated 8 years ago
- Efficient Implementation of Marching Cubes' Cases with Topological Guarantees☆24Mar 9, 2026Updated last week
- ☆20Sep 7, 2019Updated 6 years ago
- android got hook under version 5.0☆12Jun 13, 2019Updated 6 years ago
- MircoSoft Detours 4.0.1,MIT License,Support X86,X64,ARM,IA64☆12Apr 23, 2018Updated 7 years ago
- BlockBlast reimplementation + RL agents (DQN, PPO, PPO+Action Masking, DQN+Action Masking, Random)☆25Apr 25, 2025Updated 10 months ago
- Automatically exported from code.google.com/p/pyplif☆10Nov 23, 2018Updated 7 years ago
- Statistical Mechanics for Chemistry and Biology☆13Mar 11, 2026Updated last week
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆16Feb 28, 2023Updated 3 years ago
- Cornell Instruction Following Framework☆34Oct 11, 2021Updated 4 years ago
- Molar is a database management to make it easy to store experiment whether computational or not☆11Jul 15, 2022Updated 3 years ago
- Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation☆29Nov 19, 2018Updated 7 years ago
- Source code and documentation of a specialized computer assisted synthesis planning (CASP) tool used for the deconstruction of ring syste…☆12May 25, 2020Updated 5 years ago
- ☆16Jul 7, 2024Updated last year
- Nmap - the Network Mapper. Github mirror of official SVN repository.☆10Sep 5, 2018Updated 7 years ago
- OCEAN: Optimal Counterfactual Explanations in Tree Ensembles (ICML 2021)☆35Feb 16, 2026Updated last month
- Systematic generalization test for CLEVR☆15Mar 11, 2020Updated 6 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆41Aug 27, 2022Updated 3 years ago
- Bang! card game with LAN multiplayer and AI implemented in C++☆11May 26, 2015Updated 10 years ago
- example demonstrating a free energy estimation starting from OFF and OpenMM☆12Oct 21, 2020Updated 5 years ago