hmomin / PPO-Winter-RunView external linksLinks
Trains an agent with Proximal Policy Optimization (PPO) to beat Winter Run
☆23May 21, 2022Updated 3 years ago
Alternatives and similar repositories for PPO-Winter-Run
Users that are interested in PPO-Winter-Run are comparing it to the libraries listed below
Sorting:
- Reinforcement learning training project for a SLG game☆13Dec 21, 2017Updated 8 years ago
- Code accompanying my Medium series on building an AI for Poker☆15May 1, 2020Updated 5 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 2 years ago
- Port of the tensorflow c api to csharp☆22Mar 15, 2016Updated 9 years ago
- Bridging caffe2 with yolo, esp on mobile devices☆16May 6, 2017Updated 8 years ago
- Fictitious Self-play & Reinforcement Learning☆18Jan 26, 2018Updated 8 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- Analysing result obtained using quite different RL algorithm☆13Sep 5, 2019Updated 6 years ago
- My published benchmark for a Kaggle Simulations Competition☆28Dec 8, 2021Updated 4 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆43Mar 12, 2025Updated 11 months ago
- A Go based WebSerial shim/proxy☆10Dec 28, 2016Updated 9 years ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆26Oct 16, 2025Updated 4 months ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- 3600 frames of point cloud skeleton data☆15Nov 15, 2024Updated last year
- A C++ pytorch implementation of MuZero☆40May 1, 2024Updated last year
- A Rust implementation of the Monte Carlo Tree Search (MCTS) algorithm, utilizing an arena allocator for efficient memory management.☆10Jan 26, 2025Updated last year
- Poker hand evaluation for Go☆12Feb 7, 2014Updated 12 years ago
- Javascript library based on Raphael.js for editing shapes☆10Feb 15, 2024Updated 2 years ago
- Run TensorFlow on ESP32 chips without pain☆11Dec 27, 2023Updated 2 years ago
- PyTorch implementation of DreamerV3, Mastering Diverse Domains through World Models.☆10Feb 16, 2024Updated 2 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- Wolfram LibraryLink interface for Rust [Deprecated]☆10Mar 8, 2024Updated last year
- Swarm learning algorithm☆11Jun 2, 2021Updated 4 years ago
- Cassandra (CQL) driver for Rust, using the DataStax C/C++ driver under the covers.☆13Jun 17, 2022Updated 3 years ago
- An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice☆12Aug 30, 2020Updated 5 years ago
- Developing, training, and assessing the performance of a Proximal Policy Optimization (PPO) Stock Trading Agent.☆13Aug 20, 2025Updated 5 months ago
- Pusher Beams Java Server SDK☆10Feb 12, 2019Updated 7 years ago
- This code monitors (or sniff) the radiosignals sent by Uponor KNX RF thermostats and sent to OpenHAB using the REST interface. A CC1101 c…☆11Dec 2, 2022Updated 3 years ago
- The most simplest and super efficient command line tools, distributed in solely one file.☆10Nov 2, 2022Updated 3 years ago
- SQL Adventure Builder: transform a dataset and a collection of SQL exercises into a self-contained database☆10Aug 14, 2025Updated 6 months ago
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- nd009-cn-advanced-p5,针对Udacity CN MLND P5项目☆14Jun 27, 2022Updated 3 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- Emulator of the soviet ternary computer "Setun-70" (Сетунь-70)☆18Dec 9, 2024Updated last year
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago