Rainlabuw / rl-enabled-distributed-assignmentLinks
Implementation of RL-Enabled Distributed Assignment (REDA)
☆22Updated last year
Alternatives and similar repositories for rl-enabled-distributed-assignment
Users that are interested in rl-enabled-distributed-assignment are comparing it to the libraries listed below
Sorting:
- Clean RL implementation using MLX☆32Updated last year
- [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MA…☆58Updated 2 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆114Updated 2 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆78Updated 4 months ago
- ☆25Updated 4 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆64Updated 8 months ago
- Simple GRPO scripts and configurations.☆59Updated 8 months ago
- LLM reads a paper and produce a working prototype☆57Updated 6 months ago
- ☆21Updated 8 months ago
- Repo to reproduce the First-Explore paper results☆38Updated 9 months ago
- The original Shared Recurrent Memory Transformer implementation☆31Updated 3 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- ☆56Updated 3 weeks ago
- ☆40Updated last year
- Automatic Prompt Optimization☆45Updated last year
- ☆40Updated 10 months ago
- Simple repository for training small reasoning models☆40Updated 8 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆71Updated 2 years ago
- ☆61Updated 2 weeks ago
- ☆42Updated last year
- A repository re-creating the PromptBreeder Evolutionary Algorithm from the DeepMind Paper in Python using LMQL as the backend.☆28Updated last year
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆38Updated 2 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆61Updated 6 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆131Updated last week
- Code for [ICML2025]``Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design``.☆60Updated 4 months ago
- A set of communication oriented environments☆18Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 5 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆20Updated last year
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆25Updated 10 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆131Updated last year