henrycharlesworth/big2_PPOalgorithm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/henrycharlesworth/big2_PPOalgorithm)

henrycharlesworth / big2_PPOalgorithm

Application of proximal policy optimization algorithm to the card game Big 2 using Tensorflow

☆83

Alternatives and similar repositories for big2_PPOalgorithm

Users that are interested in big2_PPOalgorithm are comparing it to the libraries listed below

Sorting:

menglinjian / Deep-FTRL-ORW
View on GitHub
Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…
☆11Dec 1, 2022Updated 3 years ago
philipjball / TD3_PyTorch
View on GitHub
♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation
☆10Jun 20, 2021Updated 4 years ago
submit-paper / Doudizhu_plus
View on GitHub
☆45Oct 21, 2022Updated 3 years ago
gebob19 / rl_with_jax
View on GitHub
clear single-file JAX implementations of common RL algorithms
☆16Sep 5, 2021Updated 4 years ago
zuzuba / CISR_NeurIPS20
View on GitHub
☆18Nov 16, 2020Updated 5 years ago
brain-research / LeaveNoTrace
View on GitHub
Leave No Trace is an algorithm for safe reinforcement learning.
☆15Apr 30, 2018Updated 7 years ago
ShibiHe / Poker-Fictitious-Play
View on GitHub
Fictitious Self-play & Reinforcement Learning
☆18Jan 26, 2018Updated 8 years ago
ds-hwang / d2bs
View on GitHub
D2BS, short for Diablo 2 Botting System, uses the open source Javascript engine named 'SpiderMonkey' to execute user scripts inside of Di…
☆16Jul 9, 2016Updated 9 years ago
tondeur-h / UCIChess
View on GitHub
UCI chess protocole API in java for GUI chess clients
☆12Nov 11, 2015Updated 10 years ago
zhaoyizhou1123 / mbrcsl
View on GitHub
☆12Nov 18, 2023Updated 2 years ago
wenhuizhang / autoCar
View on GitHub
☆20Dec 10, 2018Updated 7 years ago
ftomassetti / civs-browser
View on GitHub
A web application to visualize the history files produced by csv
☆13Aug 31, 2014Updated 11 years ago
sebastianruder / tensorflow-experiments
View on GitHub
Repository for experiments with TensorFlow
☆11Nov 25, 2015Updated 10 years ago
Liuweiming / ACH_poker
View on GitHub
☆25Jul 15, 2022Updated 3 years ago
Vincentzyx / Douzero_Cloud_Client
View on GitHub
Cloud client for douzero training
☆11Dec 26, 2021Updated 4 years ago
pzaffino / python-mha
View on GitHub
Read and write mha files using Python
☆10Oct 14, 2013Updated 12 years ago
jsztompka / MultiAgent-PPO
View on GitHub
Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis
☆31Jan 9, 2019Updated 7 years ago
njustesen / a2c_gvgai
View on GitHub
A2C for GVG-AI
☆23Nov 7, 2018Updated 7 years ago
kraktos / MAB
View on GitHub
Pydata MAB Tutorial
☆10Jul 6, 2018Updated 7 years ago
mq1n / ForceThreadSuspend
View on GitHub
Windows hidden thread suspend POC with code injection
☆12May 27, 2017Updated 8 years ago
JuliaGeometry / MarchingCubes.jl
View on GitHub
Efficient Implementation of Marching Cubes' Cases with Topological Guarantees
☆24Mar 9, 2026Updated last week
koustuvsinha / clutrr-workshop
View on GitHub
☆20Sep 7, 2019Updated 6 years ago
ylcangel / android_got_hook
View on GitHub
android got hook under version 5.0
☆12Jun 13, 2019Updated 6 years ago
wanttobeno / Detours_4.0.1
View on GitHub
MircoSoft Detours 4.0.1，MIT License，Support X86，X64，ARM，IA64
☆12Apr 23, 2018Updated 7 years ago
RisticDjordje / BlockBlast-Game-AI-Agent
View on GitHub
BlockBlast reimplementation + RL agents (DQN, PPO, PPO+Action Masking, DQN+Action Masking, Random)
☆25Apr 25, 2025Updated 10 months ago
radifar / pyplif
View on GitHub
Automatically exported from code.google.com/p/pyplif
☆10Nov 23, 2018Updated 7 years ago
DPotoyan / Statmech4ChemBio
View on GitHub
Statistical Mechanics for Chemistry and Biology
☆13Mar 11, 2026Updated last week
boschresearch / ube-mbrl
View on GitHub
Model-Based Uncertainty in Value Functions (AISTATS2023)
☆16Feb 28, 2023Updated 3 years ago
lil-lab / ciff
View on GitHub
Cornell Instruction Following Framework
☆34Oct 11, 2021Updated 4 years ago
aspuru-guzik-group / molar
View on GitHub
Molar is a database management to make it easy to store experiment whether computational or not
☆11Jul 15, 2022Updated 3 years ago
llSourcell / Talking-Face-Generation-DAVS
View on GitHub
Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation
☆29Nov 19, 2018Updated 7 years ago
reymond-group / RingBreaker
View on GitHub
Source code and documentation of a specialized computer assisted synthesis planning (CASP) tool used for the deconstruction of ring syste…
☆12May 25, 2020Updated 5 years ago
PatWalters / frankenrocs
View on GitHub
☆16Jul 7, 2024Updated last year
robertdavidgraham / nmap
View on GitHub
Nmap - the Network Mapper. Github mirror of official SVN repository.
☆10Sep 5, 2018Updated 7 years ago
vidalt / OCEAN
View on GitHub
OCEAN: Optimal Counterfactual Explanations in Tree Ensembles (ICML 2021)
☆35Feb 16, 2026Updated last month
rizar / CLOSURE
View on GitHub
Systematic generalization test for CLEVR
☆15Mar 11, 2020Updated 6 years ago
hari-sikchi / LOOP
View on GitHub
Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]
☆41Aug 27, 2022Updated 3 years ago
MattSkala / bang-game
View on GitHub
Bang! card game with LAN multiplayer and AI implemented in C++
☆11May 26, 2015Updated 10 years ago
ljmartin / mmpbsa_from_openmm
View on GitHub
example demonstrating a free energy estimation starting from OFF and OpenMM
☆12Oct 21, 2020Updated 5 years ago