Multi Agent adaptation of Soft Actor Critic Reinforcement Learning Algorithm
☆22Dec 24, 2018Updated 7 years ago
Alternatives and similar repositories for masac
Users that are interested in masac are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Jax and Torch Multi-Agent SAC on PettingZoo API☆100Nov 23, 2024Updated last year
- PyTorch implementation of MATD3☆13Apr 3, 2020Updated 6 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆43Jan 29, 2019Updated 7 years ago
- Implementation for mSAC methods in PyTorch☆41Oct 10, 2021Updated 4 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆173Oct 24, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆42Mar 31, 2021Updated 5 years ago
- RLToolkit is a flexible and high-efficient reinforcement learning framework. Include implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG,…☆20Dec 14, 2023Updated 2 years ago
- Mobile Edge Computing Hierarchical Model which has the mobile as the edge server and the cloud as the central server. The load balancing …☆15May 5, 2018Updated 7 years ago
- curriculum☆27Feb 7, 2023Updated 3 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆15Jan 18, 2022Updated 4 years ago
- Source code for the "Collision Risk Assessment and Forecasting on Maritime Data" paper☆13Oct 8, 2023Updated 2 years ago
- Game Theory Course Project☆10Dec 7, 2018Updated 7 years ago
- Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019☆800May 29, 2022Updated 3 years ago
- ☆12Jan 25, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Network Calculus for end-to-end delay bounds of an AFDX network.☆16Sep 3, 2020Updated 5 years ago
- Deep Reinforcement Learning based Autonomous Driving Agents☆10Jul 7, 2022Updated 3 years ago
- Active Vision Tracking of a Tendon-driven continuum robot by using efficient model-based Reinforcement Learning☆12Aug 25, 2023Updated 2 years ago
- Code to reproduce experiments from:☆10Dec 11, 2020Updated 5 years ago
- ☆10Jun 13, 2025Updated 10 months ago
- Reinforcement Learning Practice for Multi and Single-Agent Autonomous vehicle☆13Dec 11, 2020Updated 5 years ago
- 设计制作一款能够奔跑的双足机器人,只为稳定奔跑。省去一切华丽的表演动作。如果一定要给它取个名字,就叫狂奔吧!Design and make a biped robot that can run, only for stable running. Eliminate a…☆13Jun 18, 2019Updated 6 years ago
- Improving scalability of RL algorithms using GNNs: A case study in optimal EV charging.☆29Oct 16, 2025Updated 6 months ago
- This is a ROS repository to track an underwater target using a Particle Filter range-only method and the SparusII AUV☆11Nov 27, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Biped hardware control code☆24Oct 6, 2024Updated last year
- A simple and easy to use implementation of the soft actor-critic algorithm.☆15Sep 2, 2022Updated 3 years ago
- Multi-resource Dynamic Coordinated Planning of Flexible Distribution Network☆15Jun 11, 2024Updated last year
- Code of "Towards Skilled Population Curriculum for MARL" + Implementation of Curriculum MARL algorithms based on Ray☆13Feb 20, 2023Updated 3 years ago
- Machine learning accelerated Branch and Bound for Joint beamforming and antenna selection☆26Jul 20, 2023Updated 2 years ago
- ☆19Feb 26, 2025Updated last year
- This set of codes implements our TSG paper "Hierarchical Deep Learning Model for Degradation Prediction per Look-Ahead Scheduled Battery …☆11Feb 24, 2025Updated last year
- A swarm UAVs project☆11Sep 8, 2019Updated 6 years ago
- ☆22May 20, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 硕士毕业论文代码 深度强化学习☆10Apr 4, 2020Updated 6 years ago
- ☆10Sep 23, 2019Updated 6 years ago
- implementation of flexABLE market simulation model as an OOP☆11Nov 25, 2025Updated 4 months ago
- Related paper: Online Scheduling for Energy Minimization in Wireless Powered Mobile Edge Computing☆10Jan 5, 2023Updated 3 years ago
- DistFlow Safe Reinforcement Learning Algorithm for Voltage Magnitude Regulation in Distribution Networks☆13Jul 9, 2025Updated 9 months ago
- This is a MADDPG algorithm to be used on particle environment styles. I use it to test my own scenarios for underwater target localizatio…☆18Jun 23, 2021Updated 4 years ago
- Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking (IROS22).☆11Jul 22, 2022Updated 3 years ago