Rainlabuw / rl-enabled-distributed-assignmentLinks
Implementation of RL-Enabled Distributed Assignment (REDA)
☆23Updated last year
Alternatives and similar repositories for rl-enabled-distributed-assignment
Users that are interested in rl-enabled-distributed-assignment are comparing it to the libraries listed below
Sorting:
- [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MA…☆56Updated last week
- Repo to reproduce the First-Explore paper results☆38Updated 7 months ago
- Clean RL implementation using MLX☆32Updated last year
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆69Updated 2 years ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆17Updated last year
- Automated Capability Discovery via Foundation Model Self-Exploration☆59Updated 5 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆73Updated 2 months ago
- LLM reads a paper and produce a working prototype☆58Updated 3 months ago
- Simple repository for training small reasoning models☆32Updated 6 months ago
- ☆23Updated 2 months ago
- ☆42Updated last month
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 3 months ago
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆24Updated 7 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆24Updated 4 months ago
- ☆21Updated 5 months ago
- ☆38Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆91Updated 4 months ago
- ☆40Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆31Updated last month
- Simple GRPO scripts and configurations.☆59Updated 6 months ago
- Causal Agent based on Large Language Model☆49Updated last month
- a WIP architecture designed to allow transformers to think in a manner without tokens☆20Updated last year
- ☆54Updated last year
- ☆66Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆72Updated 8 months ago
- Neuroevolution Benchmark in JAX 🦕☆39Updated last year
- A set of communication oriented environments☆12Updated 3 weeks ago
- The original Shared Recurrent Memory Transformer implementation☆30Updated last month
- LMQL implementation of tree of thoughts☆34Updated last year
- Code for [ICML2025]``Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design``.☆49Updated 2 months ago