A MARL PPO implementation with tf-agents, configured for the MultiCarRacing-v0 Gym environment.
☆19Jun 24, 2021Updated 4 years ago
Alternatives and similar repositories for marl_ppo
Users that are interested in marl_ppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-Agent Deep Recurrent Q-Learning with Bayesian epsilon-greedy on AirSim simulator☆13Apr 1, 2022Updated 4 years ago
- ☆11Nov 2, 2021Updated 4 years ago
- This repository contains the Python implementation of our submitted paper titled "Deep Reinforcement Learning for Joint Trajectory and Co…☆15Jun 29, 2024Updated last year
- A curated list of research and projects on world models☆81Updated this week
- An implementation for CVRP problem with A3C+Attention mechanism and GCN☆18May 17, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Low level control interface.☆15Jun 5, 2025Updated 11 months ago
- Resilient Multi-Agent Reinforcement Learning☆10Nov 4, 2022Updated 3 years ago
- An OpenAI Gym environment for multi-agent car racing based on Gym's original car racing environment.☆91Feb 20, 2026Updated 3 months ago
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Apr 24, 2024Updated 2 years ago
- Reinforcement Leanring Algorithms Trained with Unity☆13Apr 26, 2019Updated 7 years ago
- My implementation of common algorithms☆13Oct 6, 2019Updated 6 years ago
- ☆17Nov 16, 2020Updated 5 years ago
- UBC Snowbots Codebase☆10Apr 1, 2023Updated 3 years ago
- Classify the jamming pattern and predict the action of channel selection in the future time slots☆22Aug 28, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 浙江大学Beamer模板☆16May 19, 2022Updated 4 years ago
- Google Scholar自搜小脚本,每次开启命令行即显示当前citation。Small Script displaying current citation count each time the shell is opened.☆21Mar 3, 2025Updated last year
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆43May 25, 2022Updated 3 years ago
- ☆14Oct 26, 2022Updated 3 years ago
- an implementation of ATOC☆14Dec 6, 2021Updated 4 years ago
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 9 months ago
- Python demo for the paper "Pareto Monte Carlo Tree Search for Multi-Objective Informative Planning".☆34Nov 9, 2022Updated 3 years ago
- ☆16Jun 30, 2019Updated 6 years ago
- ☆11Apr 29, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A robotics library for Python☆20Nov 29, 2020Updated 5 years ago
- A comparison of some conformal quantile regression methods.☆12Sep 14, 2019Updated 6 years ago
- Raptor, the random arpeggiator (real-time algorithmic composition program implemented as a Pd patch)☆12Jan 12, 2018Updated 8 years ago
- [IJCAI 2021] Robust Adversarial Imitation Learning via Adaptively-Selected Demonstrations☆16Feb 17, 2023Updated 3 years ago
- Reproducible research code for the experiments presented in our article "Kara1k: a karaoke dataset for cover song identification and sing…☆10Jan 9, 2018Updated 8 years ago
- Implementation of PILCO for the Model-Based Baselines Project☆18Jul 18, 2019Updated 6 years ago
- A web app for annotating Freesound loops, and the tools to analyse the dataset created.☆20Jul 6, 2023Updated 2 years ago
- Apple watch application to collect accelerometer and gyroscope data after detecting a golf swing.☆12Jan 16, 2019Updated 7 years ago
- ☆12Jun 9, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆12Oct 20, 2023Updated 2 years ago
- PPO☆16Apr 3, 2025Updated last year
- NeurIPS paper 'Censored Quantile Regression Neural Networks for Distribution-Free Survival Analysis'☆11Oct 28, 2022Updated 3 years ago
- TRPO Implementation in Tensorflow 2.0 for Reinforcement Learning Project @ Sapienza☆16Mar 25, 2023Updated 3 years ago
- This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…☆19Oct 5, 2021Updated 4 years ago
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆23Apr 16, 2022Updated 4 years ago
- RDS is a reactive controller for convex non-holonomic robots to avoid collisions with moving obstacles.☆24Jul 8, 2022Updated 3 years ago