Rainlabuw / rl-enabled-distributed-assignmentLinks
Implementation of RL-Enabled Distributed Assignment (REDA)
☆27Updated last year
Alternatives and similar repositories for rl-enabled-distributed-assignment
Users that are interested in rl-enabled-distributed-assignment are comparing it to the libraries listed below
Sorting:
- [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MA…☆60Updated 3 weeks ago
- Clean RL implementation using MLX☆34Updated last year
- Automated Capability Discovery via Foundation Model Self-Exploration☆66Updated 11 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Updated 6 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆92Updated 8 months ago
- ☆25Updated 8 months ago
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆26Updated 2 weeks ago
- ☆22Updated 11 months ago
- ☆39Updated last year
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆66Updated 11 months ago
- ☆75Updated last week
- ☆55Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- ☆141Updated 4 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆73Updated 2 years ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆130Updated 2 months ago
- Repo to reproduce the First-Explore paper results☆39Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Updated 7 months ago
- ☆56Updated last year
- LLM reads a paper and produce a working prototype☆60Updated 9 months ago
- ☆40Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 9 months ago
- ☆27Updated last year
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆20Updated last year
- General multi-task deep RL Agent☆185Updated last year
- OpenPipe Reinforcement Learning Experiments☆32Updated 10 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated last year
- ☆43Updated 3 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆62Updated 10 months ago