Multi-Agent Reinforcement Learning with Stable-Baselines3
☆20Dec 3, 2021Updated 4 years ago
Alternatives and similar repositories for marl-baselines3
Users that are interested in marl-baselines3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆54Dec 8, 2022Updated 3 years ago
- Simple code for running and visualizing replicator dynamics☆11Jan 31, 2024Updated 2 years ago
- Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"☆21Sep 12, 2025Updated 8 months ago
- Software for performing value iteration on partially observable Markov decision processes (POMDPs).☆17Feb 2, 2024Updated 2 years ago
- ☆14Jun 21, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Simulate an epidemic metapopulation model with mobility-reducing containment strategies☆11Aug 27, 2020Updated 5 years ago
- A human mobility flow-augmented stochastic SEIR-style epidemic modeling framework is developed, which combines with data assimilation and…☆15Mar 10, 2023Updated 3 years ago
- ☆10Jun 26, 2024Updated last year
- A simple wrapper to analyse and visualise reinforcement learning agents' behaviour in the environment.☆14Jan 8, 2022Updated 4 years ago
- Official codebase for Human Guided Exploration (HuGE)☆22Aug 16, 2023Updated 2 years ago
- Repo for reproduction of sequential social dilemmas☆416Mar 6, 2025Updated last year
- Unity Chat system including audio chat, video chat and text chat through photon, socket and firebase, however where user can use this plu…☆12Jan 12, 2021Updated 5 years ago
- A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).☆32Sep 29, 2022Updated 3 years ago
- ☆13Feb 5, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Survey Analyzing Generalization in Deep Reinforcement Learning☆37Oct 31, 2024Updated last year
- 图神经网络课程——图注意力网络☆11Dec 28, 2019Updated 6 years ago
- CantoInput - an input method (IME) for Cantonese☆17Jul 7, 2011Updated 14 years ago
- HIPPO 🦛 is an explainable AI method and toolkit for weakly-supervised models in computational pathology. It enables hypothesis testing o…☆18Jan 15, 2025Updated last year
- Re-implementation of reinforcement learning based quadcopter control in gym-pybullet-drones.☆27Mar 12, 2024Updated 2 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Jun 6, 2018Updated 7 years ago
- A Really Scalable RL Framework to 10k+ CPUs☆38Feb 29, 2024Updated 2 years ago
- Transcripts for various Youtube Channels inspired by https://karpathy.ai/lexicap/index.html☆17Nov 14, 2025Updated 6 months ago
- ☆10Feb 28, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Playground for reinforcement learning algorithms implemented in TensorFlow☆16Oct 18, 2016Updated 9 years ago
- ☆17Mar 27, 2018Updated 8 years ago
- Course materials for a 3-day seminar "Machine Learning and NLP: Advances and Applications" at New College of Florida☆12Feb 10, 2022Updated 4 years ago
- ☆51Nov 20, 2025Updated 6 months ago
- AI Agents for Semi-Autonomous Public Goods Production☆12May 20, 2024Updated 2 years ago
- 基于图注意力模型(GAT)的交通网络流量预测☆16Apr 16, 2022Updated 4 years ago
- Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks☆232Oct 3, 2023Updated 2 years ago
- Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow☆20Oct 5, 2021Updated 4 years ago
- Scaling scaling laws with board games.☆53Jul 17, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code used to train an augmented auto-encoder (aka denoising auto-encoder with more augmentations) for the DonkeyCar simulator.☆47Jun 19, 2022Updated 3 years ago
- DNN Node Collection using Inference Helper in ROS2☆13Apr 24, 2022Updated 4 years ago
- ☆28Dec 29, 2025Updated 4 months ago
- Interactive and dynamic painting simulation in WebGL☆13May 2, 2024Updated 2 years ago
- Adds partial fit method to sklearn's forest estimators to allow incremental training without being limited to a linear model. Works with …☆37Jun 18, 2024Updated last year
- ☆13Dec 16, 2024Updated last year
- OMNI: Open-endedness via Models of human Notions of Interestingness☆65Jan 28, 2025Updated last year