[ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system
☆157Apr 23, 2026Updated last week
Alternatives and similar repositories for PettingLLMs
Users that are interested in PettingLLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OrcaLoca: An LLM Agent Framework for Software Issue Localization [ICML 25]☆42Apr 7, 2025Updated last year
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆26Mar 2, 2026Updated last month
- ☆25Nov 20, 2025Updated 5 months ago
- ☆17Nov 3, 2024Updated last year
- Official Repo for FoodieQA paper (EMNLP 2024)☆20Jun 26, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 11 months ago
- Implementation about a recommender System using RQ-VAE Semantic IDs☆16Apr 15, 2026Updated 2 weeks ago
- ☆30Jun 5, 2025Updated 10 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆78Sep 13, 2025Updated 7 months ago
- ☆54Feb 19, 2025Updated last year
- ☆16Mar 20, 2025Updated last year
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 10 months ago
- Reinforced Multi-LLM Agents training☆82Jan 18, 2026Updated 3 months ago
- VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments☆22Sep 30, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Under construction☆13Jan 15, 2025Updated last year
- ☆13May 13, 2025Updated 11 months ago
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- ☆23Sep 19, 2024Updated last year
- ☆25Jun 5, 2025Updated 10 months ago
- Official Implementation of wd1☆28Sep 25, 2025Updated 7 months ago
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆23Jun 2, 2025Updated 10 months ago
- ☆35May 24, 2025Updated 11 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆161Oct 30, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Chat language model that can interpret and execute functions/plugins☆14Oct 16, 2024Updated last year
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆98Apr 7, 2026Updated 3 weeks ago
- Repo for EmbedLLM: Learning Compact Representations of Large Language Models☆29Sep 25, 2025Updated 7 months ago
- ☆14Jun 3, 2025Updated 10 months ago
- ☆31Apr 12, 2026Updated 2 weeks ago
- ☆88Dec 5, 2024Updated last year
- ☆11Aug 20, 2025Updated 8 months ago
- MuJoCo benchmark for Deep Reinforcement Learning as provided by Tianshou framework.☆15Jan 12, 2025Updated last year
- something for paper agent☆11Dec 18, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Project page for the NeurIPS 2024 paper, Language Grounded Multi-agent Reinforcement Learning with Human-interpretable Communication.☆16Dec 6, 2024Updated last year
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆18Dec 1, 2023Updated 2 years ago
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆383Mar 30, 2026Updated last month
- [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…☆13Nov 11, 2024Updated last year
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆22Nov 17, 2025Updated 5 months ago
- FLOPS counter for all your GPU benchmarking needs☆13Aug 8, 2024Updated last year
- The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"☆199Dec 25, 2025Updated 4 months ago