ImmanuelXIV/ppo-self-play

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ImmanuelXIV/ppo-self-play)

ImmanuelXIV / ppo-self-play

Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment

☆20

Alternatives and similar repositories for ppo-self-play

Users that are interested in ppo-self-play are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AnanyaJain3 / Spacecraft-Trajectory-Optimization
View on GitHub
Project under CSF407 - AI
☆13Jun 24, 2024Updated 2 years ago
DRL-CASIA / Deep-Reinforcement-Learning
View on GitHub
☆18Jan 4, 2021Updated 5 years ago
hanizaidi110 / Opponent-Modeling-and-Predicting-Opponent-moves-in-Poker
View on GitHub
Advanced_Data_Integration_Project
☆11Jul 31, 2018Updated 7 years ago
jhhom / fjsp-gnnrl
View on GitHub
Solving Flexible Job Shop Scheduling by learning to dispatch with Deep Reinforcement Learning
☆13Jul 21, 2022Updated 4 years ago
khalil-research / Multi-Task_Predict-then-Optimize
View on GitHub
Multi-task end-to-end predict-then-optimize
☆14Apr 28, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
weixians / GnnJssp
View on GitHub
基于图神经网络解决JSSP(job shop scheduling problem)问题
☆12Nov 29, 2023Updated 2 years ago
hijkzzz / noisy-mappo
View on GitHub
Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)
☆76Jun 9, 2023Updated 3 years ago
dspub99 / betazero
View on GitHub
Tabula Rasa Tic-Tac-Toe
☆10Jan 3, 2019Updated 7 years ago
mjanschek / pytorch_seed_rl
View on GitHub
A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.
☆14Dec 8, 2020Updated 5 years ago
NickGeramanis / rl-uav
View on GitHub
Undergraduate Thesis.
☆11Apr 13, 2025Updated last year
int8 / regret-matching
View on GitHub
Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play
☆26Sep 25, 2018Updated 7 years ago
dranaju / curl_navigation
View on GitHub
☆11Sep 15, 2023Updated 2 years ago
emanuelepesce / unmanned-aerial-vehicles-marl-env
View on GitHub
Multi Agent Reinforcement Learning Environment For Aerial Unmanned Vehicles
☆13Apr 13, 2023Updated 3 years ago
tmoer / a0c
View on GitHub
Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)
☆15Jan 19, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
marcbrittain / Prioritized-Sequence-Experience-Replay
View on GitHub
Prioritized Sequence Experience Replay
☆10Aug 16, 2021Updated 4 years ago
morning9393 / HAPPO-HATRPO
View on GitHub
☆48Nov 29, 2021Updated 4 years ago
boredengineering / OmniIsaacGymEnvs
View on GitHub
Reinforcement Learning Environments for Omniverse Isaac Gym
☆10May 9, 2023Updated 3 years ago
micah35s / Autoencoder-Image-Compression
View on GitHub
Pytorch implementation for image compression and reconstruction via autoencoder
☆10Jun 17, 2020Updated 6 years ago
maximilianigl / rl-iter
View on GitHub
Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
☆11Jun 8, 2020Updated 6 years ago
Snow-Dancing / ReinforcementLearning
View on GitHub
利用强化学习的Q价值迭代,Q学习以及SARSA方法解决小车爬山以及倒立摆的控制问题
☆15Jul 25, 2019Updated 7 years ago
instadeepai / awesome-marl
View on GitHub
A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers
☆58Jan 20, 2023Updated 3 years ago
TJU-DRL-LAB / Multiagent-RL
View on GitHub
The official code releasement of publications in MARL field of TJU RL lab.
☆89Jul 15, 2022Updated 4 years ago
since-89 / FJSP-GNN-DRL
View on GitHub
The Repo Solves a Flexible Job Shop Scheduling Problem using Deep RL Technique
☆23Jul 28, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Gj-12222 / ajing_marl
View on GitHub
This is a personal library that strives to implement various MARL algorithms. The environment only integrates MPE, and the algorithm curr…
☆15May 22, 2025Updated last year
ignc-research / arena-marl
View on GitHub
Multi Agent Reinforcement Learning for ROS in 2D Simulation Environments
☆16Nov 15, 2021Updated 4 years ago
GT-STAR-Lab / MARBLER
View on GitHub
Multi-Robot RL Benchmark and Learning Environment for the Robotarium | IEEE MRS 2023 (Best Paper Award)
☆14Mar 31, 2025Updated last year
yjpark1 / competitiveMARL
View on GitHub
multi-agent reinforcement learning for competitive environments using pytorch
☆14Dec 31, 2019Updated 6 years ago
heinrichjh / nfsp-leduc
View on GitHub
Neural Fictitious Self-Play in Leduc Holdem
☆11Jul 4, 2018Updated 8 years ago
gcsarker / vlm_nav
View on GitHub
Vision language model guided monocular vision based UAV navigation
☆16May 14, 2026Updated 2 months ago
RealZST / DRL-based_UAV_Motion_Planning
View on GitHub
code for `A Hybrid Human-in-the-Loop Deep Reinforcement Learning Method for UAV Motion Planning'
☆14Jan 15, 2024Updated 2 years ago
TonghanWang / RODE
View on GitHub
Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …
☆88Dec 17, 2024Updated last year
bryanoliveira / sliding-puzzles-gym
View on GitHub
A scalable benchmark for state representation learning in visual reinforcement learning.
☆17Jun 23, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
albert-jin / boids-pe
View on GitHub
Boids-PE: A Deep Reinforcement Learning Approach for UAV Pursuit-Evasion: Integrating Boids Model and Apollonian Circles
☆25Jun 29, 2024Updated 2 years ago
lili-chen / SEER
View on GitHub
Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.
☆21Mar 5, 2021Updated 5 years ago
mbiselx / LeggedRobots
View on GitHub
Miniprojects for the MICRO-507 : Legged Robots course
☆12Jul 1, 2022Updated 4 years ago
ShangtongZhang / ShangtongZhang.github.io
View on GitHub
My Homepage
☆10Jun 26, 2026Updated last month
ogroth / shapestacks
View on GitHub
Tensorflow models and simulation code for 'ShapeStacks: Learning Vision-Based Physical Intuition for Generalised Object Stacking'
☆46Mar 24, 2023Updated 3 years ago
siyuandong16 / Tactile_insertion_with_RL
View on GitHub
☆11Mar 31, 2020Updated 6 years ago
RajGhugare19 / VE-principle-for-model-based-RL
View on GitHub
Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…
☆18Apr 13, 2021Updated 5 years ago