Rainlabuw / rl-enabled-distributed-assignmentLinks
Implementation of RL-Enabled Distributed Assignment (REDA)
☆25Updated last year
Alternatives and similar repositories for rl-enabled-distributed-assignment
Users that are interested in rl-enabled-distributed-assignment are comparing it to the libraries listed below
Sorting:
- [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MA…☆60Updated this week
- Clean RL implementation using MLX☆34Updated last year
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆42Updated 2 weeks ago
- ☆25Updated 7 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆89Updated 7 months ago
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆26Updated last year
- Repo to reproduce the First-Explore paper results☆38Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- ☆22Updated 11 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆66Updated 11 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆124Updated 2 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Updated 6 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Updated 6 months ago
- ☆28Updated 9 months ago
- ☆39Updated last year
- Universal Reasoning Model☆119Updated this week
- Official repository of the 2025 paper, LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra.☆62Updated this week
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Updated last year
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 10 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆24Updated last year
- ☆73Updated this week
- TaskMet Task-driven Metric Learning for Model Learning☆20Updated last year
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆117Updated 3 months ago
- A set of communication oriented environments☆25Updated 6 months ago
- Code for [ICML2025]``Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design``.☆67Updated 7 months ago
- ☆94Updated this week
- a suite of finetuned LLMs for atomically precise function calling 🧪☆17Updated this week
- Simple GRPO scripts and configurations.☆59Updated 11 months ago
- ☆128Updated last year
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Updated last month