Rainlabuw / rl-enabled-distributed-assignmentLinks
Implementation of RL-Enabled Distributed Assignment (REDA)
☆20Updated last year
Alternatives and similar repositories for rl-enabled-distributed-assignment
Users that are interested in rl-enabled-distributed-assignment are comparing it to the libraries listed below
Sorting:
- ☆22Updated last month
- Clean RL implementation using MLX☆32Updated last year
- Repo to reproduce the First-Explore paper results☆37Updated 6 months ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆31Updated last year
- Automated Capability Discovery via Foundation Model Self-Exploration☆57Updated 5 months ago
- Simple repository for training small reasoning models☆33Updated 5 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated 3 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆60Updated 4 months ago
- LLM reads a paper and produce a working prototype☆58Updated 3 months ago
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆24Updated 7 months ago
- [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MA…☆55Updated 2 weeks ago
- The original Shared Recurrent Memory Transformer implementation☆27Updated last week
- How to create rational LLM-based agents? Using game-theoretic workflows!☆72Updated last month
- Causal Agent based on Large Language Model☆47Updated 3 weeks ago
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆35Updated 2 weeks ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆69Updated 2 years ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆24Updated 3 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆32Updated 3 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆17Updated last year
- ☆40Updated last year
- ☆41Updated last week
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- INTeractive learning via REPresentatIon Discovery☆34Updated last year
- Simple GRPO scripts and configurations.☆59Updated 5 months ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆95Updated 3 weeks ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 3 weeks ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 5 months ago
- Learn online intrinsic rewards from LLM feedback☆41Updated 7 months ago
- ☆40Updated 7 months ago
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Updated 4 months ago