Rainlabuw / rl-enabled-distributed-assignmentLinks
Implementation of RL-Enabled Distributed Assignment (REDA)
☆22Updated last year
Alternatives and similar repositories for rl-enabled-distributed-assignment
Users that are interested in rl-enabled-distributed-assignment are comparing it to the libraries listed below
Sorting:
- [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MA…☆57Updated last month
- ☆25Updated 3 months ago
- Clean RL implementation using MLX☆33Updated last year
- Repo to reproduce the First-Explore paper results☆38Updated 9 months ago
- Code for [ICML2025]``Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design``.☆55Updated 4 months ago
- Causal Agent based on Large Language Model☆51Updated 2 weeks ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆64Updated 7 months ago
- The official repository of ALE-Bench☆114Updated this week
- How to create rational LLM-based agents? Using game-theoretic workflows!☆74Updated 3 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆111Updated last month
- LLM reads a paper and produce a working prototype☆56Updated 5 months ago
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆25Updated 9 months ago
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆38Updated last month
- ☆11Updated last year
- ☆21Updated 7 months ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆105Updated this week
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆70Updated 2 years ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 5 months ago
- ☆27Updated 5 months ago
- ☆45Updated 4 months ago
- Automatic Prompt Optimization☆44Updated last year
- ☆68Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆75Updated 9 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆64Updated 7 months ago
- Simple GRPO scripts and configurations.☆59Updated 7 months ago
- ☆52Updated this week
- ☆39Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆32Updated 3 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆20Updated last year
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆35Updated last year