cgrivera/ai-arena

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cgrivera/ai-arena)

cgrivera / ai-arena

The AI Arena: A framework for distributed multi-agent reinforcement learning

☆14

Alternatives and similar repositories for ai-arena

Users that are interested in ai-arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ying-wen / gr2
View on GitHub
Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning
☆14Dec 8, 2022Updated 3 years ago
zoeyuchao / MPE-pytorch
View on GitHub
This is MPE-pytorch, fix some bugs.
☆11Apr 26, 2020Updated 6 years ago
Amanda2024 / CARE-SMAC-MA_SAC
View on GitHub
Multi-task Multi-agent Soft Actor Critic for SMAC
☆15Jan 18, 2022Updated 4 years ago
andry91 / Max_Sum_Python
View on GitHub
MaxSum is an algorithm about Distributed Constraint Optimization Problems (DCOPs)
☆11Jan 15, 2018Updated 8 years ago
YaoYuBJTU / Algorithms_for_solving_VRP
View on GitHub
Implementation of VRP solution algorithm in Python
☆10Apr 5, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yeoneee / TS-SPMA
View on GitHub
TS_SPMA: The Tabu Search algorithm for simultaneous scheduling problem of machines and AGVs.
☆12Apr 30, 2021Updated 5 years ago
zcchenvy / CIL-DDQN
View on GitHub
code of paper 《Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem》
☆17Dec 14, 2020Updated 5 years ago
tania2333 / DQN_MADDPG_practice
View on GitHub
RL projects including implementation of DQN/DDPG/MADDPG/BicNet on StarCraft II multi-agent learning environment SMAC
☆46Feb 7, 2020Updated 6 years ago
harish-kamath / rqae
View on GitHub
Residual Quantization Autoencoder, used for interpreting LLMs
☆14Jan 1, 2025Updated last year
feli-s / algorithm-selector-for-AGV-scheduling
View on GitHub
This repository stores code about the JSSP and FJSSP scheduling problem solved with two constraint programming solvers: IBM CPLEX CP Opti…
☆15Dec 15, 2022Updated 3 years ago
Psi-Prod / ppx_system
View on GitHub
ppx_system is a syntax extension to known operating system at compile time
☆12May 9, 2023Updated 3 years ago
shivirity / agv_simulator
View on GitHub
simulator for agv scheduling system (Project 2023, SJTU)
☆14May 22, 2023Updated 3 years ago
kristychoi / pixel_exploration
View on GitHub
PyTorch implementation of Count-Based Exploration with Neural Density Models
☆10Mar 22, 2018Updated 8 years ago
julienroyd / coordination-marl
View on GitHub
Code to reproduce experiments from:
☆10Dec 11, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
CPUFronz / SC2_HarvesterAgent
View on GitHub
A StarCraft 2 agent for harvesting resources
☆13Jun 12, 2018Updated 8 years ago
frankroeder / lanro-gym
View on GitHub
OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning
☆14Jan 27, 2026Updated 5 months ago
Butanium / monte-carlo-tree-search-TSP
View on GitHub
Monte Carlo tree search for the travelling salesman problem (MCTS for the TSP)
☆12Jun 18, 2022Updated 4 years ago
ahjwang / messenger-emma
View on GitHub
Implements the Messenger environment and EMMA model.
☆25Jun 14, 2023Updated 3 years ago
sferes2 / modular_QD
View on GitHub
**Sferes2 module** A unifying modular framework for Quality-Diversity algorithms
☆22Nov 6, 2020Updated 5 years ago
young-geng / SimpleSAC
View on GitHub
A simple and easy to use implementation of the soft actor-critic algorithm.
☆15Sep 2, 2022Updated 3 years ago
LeKhacToan / control-multiple-agv
View on GitHub
Path finding, task scheduling for multiple agv robot
☆22Dec 9, 2022Updated 3 years ago
coolmoon327 / Online-Scheduling-for-Energy-Minimization-in-Wireless-Powered-Mobile-Edge-Computing
View on GitHub
Related paper: Online Scheduling for Energy Minimization in Wireless Powered Mobile Edge Computing
☆10Jan 5, 2023Updated 3 years ago
ThomasRochefortB / torch-gato
View on GitHub
Pytorch implementation of the Gato paper from Deepmind
☆12Feb 8, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
imasmitja / stalker
View on GitHub
This is a ROS repository to track an underwater target using a Particle Filter range-only method and the SparusII AUV
☆11Nov 27, 2024Updated last year
s3rvac / lemke-howson
View on GitHub
Implementation of the Lemke-Howson algorithm for finding MNE
☆15Nov 2, 2013Updated 12 years ago
M-J-Murray / SFAgents
View on GitHub
Various implementations of ConvNet reinforcement learning agents trained against Street Fighter using the MAMEToolkit
☆24Jan 10, 2020Updated 6 years ago
WentseChen / Soft-QMIX
View on GitHub
Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization
☆15Jul 3, 2024Updated 2 years ago
aerorobotics / neural-swarm
View on GitHub
Code related to the Neural-Swarm (ICRA 2020, Journal) papers
☆29Mar 23, 2022Updated 4 years ago
philippkiesling / stable-baselines3-contrib-maskable-recurrent-ppo
View on GitHub
Combination of Maskable PPO and Recurrent PPO based on the sb3-contrib repository
☆12Feb 22, 2023Updated 3 years ago
sadrach-cl / conf
View on GitHub
all conf for apps
☆15Apr 26, 2024Updated 2 years ago
Wnight963 / UAV_Optim_Pytorch
View on GitHub
☆10Apr 7, 2021Updated 5 years ago
c-cube / ocaml-avro
View on GitHub
[DEPRECATED (use avro-simple)] Runtime library and schema compiler for the Avro serialization format.
☆21Jul 7, 2026Updated last week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
seanjhardy / HyperLife
View on GitHub
A realtime multicellular organism evolution simulator with Verlet integration
☆12May 30, 2021Updated 5 years ago
FlickerNiko / SAC-QMIX
View on GitHub
Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.
☆60May 20, 2022Updated 4 years ago
aws-deepracer / aws-deepracer-notebooks
View on GitHub
Provides a jailbreak experience of AWS DeepRacer, giving us more control over the training/simulation process and RL algorithm tuning
☆18Feb 17, 2023Updated 3 years ago
robostac / coders-strike-back-referee
View on GitHub
Brutaltester compatible referee for coders strike back
☆13Jun 1, 2026Updated last month
eczy / rl-drone-coverage
View on GitHub
This is an implementation of the paper Cooperative and Distributed Reinforcement Learning of Drones for Field Coverage by Huy Xuan Pham, …
☆20Jun 29, 2020Updated 6 years ago
Egiob / DiversityIsAllYouNeed-SB3
View on GitHub
Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.
☆13Jul 11, 2022Updated 4 years ago
lafeychine / scala-native-sfml
View on GitHub
Scala Native 3 bindings for SFML library
☆15Jul 9, 2023Updated 3 years ago