[ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system
☆147Apr 6, 2026Updated this week
Alternatives and similar repositories for PettingLLMs
Users that are interested in PettingLLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆61Dec 18, 2025Updated 3 months ago
- OrcaLoca: An LLM Agent Framework for Software Issue Localization [ICML 25]☆40Apr 7, 2025Updated last year
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆26Mar 2, 2026Updated last month
- ☆17Nov 3, 2024Updated last year
- Official Repo for FoodieQA paper (EMNLP 2024)☆20Jun 26, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 10 months ago
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆37Feb 4, 2026Updated 2 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆75Sep 13, 2025Updated 6 months ago
- ☆37Dec 16, 2025Updated 3 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆83Jan 14, 2025Updated last year
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 9 months ago
- EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large …☆14Apr 1, 2025Updated last year
- Reinforced Multi-LLM Agents training☆80Jan 18, 2026Updated 2 months ago
- Under construction☆13Jan 15, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆13May 13, 2025Updated 10 months ago
- ☆24Jun 5, 2025Updated 10 months ago
- Official Implementation of wd1☆25Sep 25, 2025Updated 6 months ago
- ☆35May 24, 2025Updated 10 months ago
- ☆14Nov 19, 2024Updated last year
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆97Updated this week
- ☆54Feb 19, 2025Updated last year
- PyTorch implementation for NAACL 2022 paper: "Document-Level Relation Extraction with Sentences Importance Estimation and Focusing"☆17Apr 29, 2022Updated 3 years ago
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆18Jun 11, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repo for EmbedLLM: Learning Compact Representations of Large Language Models☆29Sep 25, 2025Updated 6 months ago
- ☆14Mar 15, 2022Updated 4 years ago
- ☆14Jun 3, 2025Updated 10 months ago
- ☆29Mar 25, 2026Updated 2 weeks ago
- ☆11Aug 20, 2025Updated 7 months ago
- MuJoCo benchmark for Deep Reinforcement Learning as provided by Tianshou framework.☆15Jan 12, 2025Updated last year
- ☆83Dec 5, 2024Updated last year
- Code and dataset of CodeSteer☆90Mar 26, 2025Updated last year
- Project page for the NeurIPS 2024 paper, Language Grounded Multi-agent Reinforcement Learning with Human-interpretable Communication.☆17Dec 6, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆18Dec 1, 2023Updated 2 years ago
- Code for GeSS: Benchmarking Geometric Deep Learning under Scientific Applications with Distribution Shifts☆16Dec 28, 2024Updated last year
- Predict binding affinity of ligand-protein complexes using Graph Neural Networks. The model is implemented using PyTorch Geometric and ba…☆11Nov 26, 2022Updated 3 years ago
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆21Nov 17, 2025Updated 4 months ago
- The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"☆193Dec 25, 2025Updated 3 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆479Feb 19, 2026Updated last month