Rainlabuw / rl-enabled-distributed-assignmentLinks
Implementation of RL-Enabled Distributed Assignment (REDA)
☆22Updated last year
Alternatives and similar repositories for rl-enabled-distributed-assignment
Users that are interested in rl-enabled-distributed-assignment are comparing it to the libraries listed below
Sorting:
- ☆24Updated 3 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆63Updated 6 months ago
- [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MA…☆58Updated last month
- Clean RL implementation using MLX☆32Updated last year
- A set of communication oriented environments☆14Updated last month
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆70Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆38Updated 8 months ago
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆25Updated 8 months ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆100Updated last week
- Simple GRPO scripts and configurations.☆59Updated 6 months ago
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆20Updated 7 months ago
- LLM reads a paper and produce a working prototype☆57Updated 4 months ago
- ATLAS is a sophisticated real-time risk analysis system designed for institutional-grade market risk assessment. Built with high-frequenc…☆12Updated 7 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆106Updated 3 weeks ago
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆38Updated 2 weeks ago
- Causal Agent based on Large Language Model☆50Updated 2 months ago
- ☆44Updated last month
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆18Updated last year
- How to create rational LLM-based agents? Using game-theoretic workflows!☆75Updated 2 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated last week
- Code for [ICML2025]``Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design``.☆54Updated 3 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆63Updated 6 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 4 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆67Updated 8 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆75Updated 8 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆88Updated this week
- OMNI: Open-endedness via Models of human Notions of Interestingness☆55Updated 7 months ago
- LMQL implementation of tree of thoughts☆34Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆131Updated last year
- Simple repository for training small reasoning models☆37Updated 6 months ago