Rainlabuw / rl-enabled-distributed-assignmentLinks
Implementation of RL-Enabled Distributed Assignment (REDA)
☆24Updated last year
Alternatives and similar repositories for rl-enabled-distributed-assignment
Users that are interested in rl-enabled-distributed-assignment are comparing it to the libraries listed below
Sorting:
- [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MA…☆60Updated 4 months ago
- Clean RL implementation using MLX☆33Updated last year
- How to create rational LLM-based agents? Using game-theoretic workflows!☆84Updated 5 months ago
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆25Updated 11 months ago
- ☆22Updated 9 months ago
- Repo to reproduce the First-Explore paper results☆38Updated 11 months ago
- ☆40Updated last year
- ☆25Updated 6 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆65Updated 9 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆71Updated 2 years ago
- LMQL implementation of tree of thoughts☆34Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆34Updated 5 months ago
- ☆41Updated 7 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- ☆88Updated 3 weeks ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆119Updated last week
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated 11 months ago
- rock paper scissors game using Double-DQN against different random generator☆17Updated 6 years ago
- A set of communication oriented environments☆20Updated 4 months ago
- ☆64Updated last week
- ☆44Updated last year
- a suite of finetuned LLMs for atomically precise function calling 🧪☆17Updated this week
- Simple GRPO scripts and configurations.☆59Updated 9 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆23Updated last year
- The original Shared Recurrent Memory Transformer implementation☆33Updated 4 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 9 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 7 months ago
- LLM reads a paper and produce a working prototype☆58Updated 7 months ago
- ☆43Updated 2 weeks ago
- ☆40Updated 11 months ago