Rainlabuw / rl-enabled-distributed-assignmentLinks
Implementation of RL-Enabled Distributed Assignment (REDA)
☆24Updated last year
Alternatives and similar repositories for rl-enabled-distributed-assignment
Users that are interested in rl-enabled-distributed-assignment are comparing it to the libraries listed below
Sorting:
- [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MA…☆60Updated 4 months ago
- Clean RL implementation using MLX☆34Updated last year
- ☆22Updated 10 months ago
- ☆25Updated 6 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 8 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆65Updated 10 months ago
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆41Updated this week
- ☆92Updated last month
- Repo to reproduce the First-Explore paper results☆38Updated 11 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Updated 5 months ago
- LLM reads a paper and produce a working prototype☆60Updated 8 months ago
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Updated 9 months ago
- ☆40Updated last year
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆26Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆45Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆23Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- How to create rational LLM-based agents? Using game-theoretic workflows!☆86Updated 6 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆120Updated last month
- ☆77Updated 2 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆72Updated 2 years ago
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆22Updated 2 months ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆59Updated this week
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆17Updated 2 months ago
- a WIP architecture designed to allow transformers to think in a manner without tokens☆20Updated last year
- ☆42Updated 7 months ago
- ☆27Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆89Updated last week
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆113Updated 2 months ago
- An introduction to DSPy☆32Updated 3 months ago