Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment
☆20Dec 2, 2025Updated 5 months ago
Alternatives and similar repositories for ppo-self-play
Users that are interested in ppo-self-play are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Oct 6, 2019Updated 6 years ago
- Project under CSF407 - AI☆13Jun 24, 2024Updated last year
- ☆18Jan 4, 2021Updated 5 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- A scalable benchmark for state representation learning in visual reinforcement learning.☆17Jun 23, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆76Jun 9, 2023Updated 2 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆14Dec 8, 2020Updated 5 years ago
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- ☆11Sep 15, 2023Updated 2 years ago
- Undergraduate Thesis.☆11Apr 13, 2025Updated last year
- UAV PATH TRACKING AND DYNAMIC AVOIDANCE BASED ON ADS-B AND DEEP REINFORCEMENT LEARNING for Univerisity of Bristol RP3 final☆12Apr 18, 2023Updated 3 years ago
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆26Sep 25, 2018Updated 7 years ago
- Multi Agent Reinforcement Learning Environment For Aerial Unmanned Vehicles☆13Apr 13, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- ☆48Nov 29, 2021Updated 4 years ago
- Reinforcement Learning Environments for Omniverse Isaac Gym☆10May 9, 2023Updated 3 years ago
- Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning☆11Jun 8, 2020Updated 5 years ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆58Jan 20, 2023Updated 3 years ago
- This is a personal library that strives to implement various MARL algorithms. The environment only integrates MPE, and the algorithm curr…☆15May 22, 2025Updated last year
- The official code releasement of publications in MARL field of TJU RL lab.☆90Jul 15, 2022Updated 3 years ago
- Multi Agent Reinforcement Learning for ROS in 2D Simulation Environments☆16Nov 15, 2021Updated 4 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆86Dec 17, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- code for `A Hybrid Human-in-the-Loop Deep Reinforcement Learning Method for UAV Motion Planning'☆14Jan 15, 2024Updated 2 years ago
- Boids-PE: A Deep Reinforcement Learning Approach for UAV Pursuit-Evasion: Integrating Boids Model and Apollonian Circles☆24Jun 29, 2024Updated last year
- Repository for (for now) filing bug reports about PLAI.☆15Jul 5, 2025Updated 10 months ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- Policy learning of in-hand manipulation. Proximal policy optimization trains the Allegro hand to learn a stabilizing grasp☆14Feb 5, 2024Updated 2 years ago
- 利用强化学习的Q价值迭代,Q学习以及SARSA方法解决小车爬山以及倒立摆的控制问题☆14Jul 25, 2019Updated 6 years ago
- Miniprojects for the MICRO-507 : Legged Robots course☆12Jul 1, 2022Updated 3 years ago
- My Homepage☆10May 16, 2026Updated last week
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆31Jan 19, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Tensorflow models and simulation code for 'ShapeStacks: Learning Vision-Based Physical Intuition for Generalised Object Stacking'☆46Mar 24, 2023Updated 3 years ago
- Multi-Robot RL Benchmark and Learning Environment for the Robotarium | IEEE MRS 2023 (Best Paper Award)☆14Mar 31, 2025Updated last year
- This branch contain the java classes for orekit-python-wrapper☆19May 13, 2026Updated 2 weeks ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- This project is the partially open source code of EI thesis Multi-Dimensional Decision-Making for UAV Air Combat Based on Hierarchical Re…☆27Apr 17, 2024Updated 2 years ago
- Website for Alloytools☆13Nov 3, 2025Updated 6 months ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆24Feb 27, 2022Updated 4 years ago