Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment
☆20Dec 2, 2025Updated 3 months ago
Alternatives and similar repositories for ppo-self-play
Users that are interested in ppo-self-play are comparing it to the libraries listed below
Sorting:
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- Project under CSF407 - AI☆13Jun 24, 2024Updated last year
- My Submission for the OpenAI/NeurIPS ProcGen Competition☆11Nov 12, 2020Updated 5 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆15Dec 8, 2020Updated 5 years ago
- ☆16Oct 6, 2019Updated 6 years ago
- ☆18Jan 4, 2021Updated 5 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- ☆45Nov 29, 2021Updated 4 years ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆56Jan 20, 2023Updated 3 years ago
- Stanford CS234: Reinforcement Learning Winter 2020☆19Mar 24, 2023Updated 2 years ago
- A python package to design and debug RL agents.☆33Jan 15, 2026Updated last month
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆26Sep 25, 2018Updated 7 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- (AAAI'2019) The codes, models, logs, and data for an extended paper of the original paper "On Reinforcement Learning for Full-length Game…☆31Oct 5, 2022Updated 3 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆76Jun 9, 2023Updated 2 years ago
- ☆33Apr 29, 2023Updated 2 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Jan 19, 2023Updated 3 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆87Jul 15, 2022Updated 3 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆83Dec 17, 2024Updated last year
- ☆35Sep 5, 2020Updated 5 years ago
- Example application for V4L2☆14Sep 24, 2025Updated 5 months ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆27Oct 16, 2025Updated 4 months ago
- ☆11Jun 20, 2022Updated 3 years ago
- Solves the Riccati differential equation for the finite-horizon linear quadratic regulator.☆13Dec 8, 2022Updated 3 years ago
- Some Orbital Mechanics Matlab Codes. Heavily based on the "Orbital Mechanics for Engineers, Howard D. Curtis" book.☆10Apr 17, 2023Updated 2 years ago
- Implementation of Consensus Based Bundle Algorithm (CBBA) with python☆44Nov 25, 2022Updated 3 years ago
- PyTorch implementation of the paper: Multi-Agent Collaborative Inference via DNN Decoupling: Intermediate Feature Compression and Edge Le…☆46Oct 26, 2023Updated 2 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- ☆41Jan 3, 2025Updated last year
- ROS wrapper for SMAC, a versatile tool for optimizing algorithm parameters☆11Jul 19, 2021Updated 4 years ago
- nd009-cn-advanced-p5,针对Udacity CN MLND P5项目☆14Jun 27, 2022Updated 3 years ago
- MLflow App Using React, Hooks, RabbitMQ, FastAPI Server, Celery, Microservices☆11Sep 25, 2022Updated 3 years ago