[ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system
☆125Mar 18, 2026Updated this week
Alternatives and similar repositories for PettingLLMs
Users that are interested in PettingLLMs are comparing it to the libraries listed below
Sorting:
- OrcaLoca: An LLM Agent Framework for Software Issue Localization [ICML 25]☆39Apr 7, 2025Updated 11 months ago
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆26Mar 2, 2026Updated 2 weeks ago
- ☆17Nov 3, 2024Updated last year
- ☆15Aug 4, 2025Updated 7 months ago
- The Code for the EMNLP 2023 main conference paper "Prompt-based Logical Semantics Enhancement for Implicit Discourse Relation Recognition…☆13Dec 10, 2023Updated 2 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 10 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆71Sep 13, 2025Updated 6 months ago
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆60Dec 18, 2025Updated 3 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆83Jan 14, 2025Updated last year
- EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large …☆13Apr 1, 2025Updated 11 months ago
- Reinforced Multi-LLM Agents training☆76Jan 18, 2026Updated 2 months ago
- ☆13May 13, 2025Updated 10 months ago
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- Official Implementation of wd1☆24Sep 25, 2025Updated 5 months ago
- ☆23Nov 20, 2025Updated 4 months ago
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆21Jun 2, 2025Updated 9 months ago
- ☆34May 24, 2025Updated 9 months ago
- ☆42Jan 6, 2026Updated 2 months ago
- Automaton & Cognition☆16Apr 14, 2024Updated last year
- ☆14Nov 19, 2024Updated last year
- A list of articles outside of the official MLIR docs that I've found useful for learning MLIR☆11Aug 16, 2023Updated 2 years ago
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆96Mar 5, 2026Updated 2 weeks ago
- ☆53Feb 19, 2025Updated last year
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆18Jun 11, 2024Updated last year
- ☆13Mar 15, 2022Updated 4 years ago
- MuJoCo benchmark for Deep Reinforcement Learning as provided by Tianshou framework.☆15Jan 12, 2025Updated last year
- ☆29Mar 13, 2026Updated last week
- ☆10Sep 16, 2021Updated 4 years ago
- Project page for the NeurIPS 2024 paper, Language Grounded Multi-agent Reinforcement Learning with Human-interpretable Communication.☆17Dec 6, 2024Updated last year
- Predict binding affinity of ligand-protein complexes using Graph Neural Networks. The model is implemented using PyTorch Geometric and ba…☆11Nov 26, 2022Updated 3 years ago
- Code for GeSS: Benchmarking Geometric Deep Learning under Scientific Applications with Distribution Shifts☆16Dec 28, 2024Updated last year
- The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"☆189Dec 25, 2025Updated 2 months ago
- FLOPS counter for all your GPU benchmarking needs☆13Aug 8, 2024Updated last year
- ☆28Oct 2, 2025Updated 5 months ago
- ☆22May 5, 2025Updated 10 months ago
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆24Oct 8, 2024Updated last year
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆17Feb 21, 2025Updated last year
- Exploring Reinforcement Learning Solutions to the Vehicle Routing Problem. PPO, A2C, DQN, SAC☆23Sep 8, 2023Updated 2 years ago