MozerWang/AMPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MozerWang/AMPO)

MozerWang / AMPO

[ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents

☆51

Alternatives and similar repositories for AMPO

Users that are interested in AMPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MozerWang / DEMO
View on GitHub
[ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling
☆22Dec 16, 2024Updated last year
MozerWang / promISe
View on GitHub
[COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search
☆23Aug 26, 2024Updated last year
MozerWang / Loong
View on GitHub
[EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA
☆155Dec 22, 2025Updated 7 months ago
rhyang2021 / ARIA
View on GitHub
Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".
☆30Aug 9, 2025Updated 11 months ago
RainBowLuoCS / MMEvol
View on GitHub
(ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"
☆22May 15, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
RainBowLuoCS / DEEM
View on GitHub
(ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.
☆51Jul 1, 2025Updated last year
Trae1ounG / BuPO
View on GitHub
[arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
☆60Feb 6, 2026Updated 5 months ago
maitrix-org / dynamic-alignment-optimization
View on GitHub
[EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…
☆24Nov 17, 2024Updated last year
Tongyi-CCAI / Complex-IF
View on GitHub
☆34Jan 26, 2026Updated 5 months ago
ZNLP / Language-Imbalance-Driven-Rewarding
View on GitHub
[ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving
☆25Apr 6, 2026Updated 3 months ago
TheNewBeeKing / MemPO
View on GitHub
The official repository of paper: MemPO: Self-Memory Policy Optimization for Long-Horizon Agents
☆24Apr 10, 2026Updated 3 months ago
apple / ml-mia-bench
View on GitHub
This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs
☆38Mar 9, 2025Updated last year
October2001 / ProLong
View on GitHub
[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models
☆61Jul 23, 2024Updated 2 years ago
ybwang119 / label_recovery
View on GitHub
[ICLR 2024] Towards Elminating Hard Label Constraints in Gradient Inverision Attacks
☆14Feb 6, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ljcleo / agent_sense
View on GitHub
Benchmarking Social Intelligence of Language Agents through Interactive Scenarios
☆13Jan 4, 2025Updated last year
AIGeeksGroup / PresentAgent-2
View on GitHub
PresentAgent-2: Towards Generalist Multimodal Presentation Agents
☆17Jun 5, 2026Updated last month
TEAM-ARM / arm
View on GitHub
[NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model
☆68Apr 6, 2026Updated 3 months ago
chang-github-00 / Predictive-Decoding
View on GitHub
Repo for Anonymous purpose, pls don't distribute
☆10Oct 2, 2024Updated last year
sotopia-lab / sotopia-rl
View on GitHub
Sotopia-RL: Reward Design for Social Intelligence
☆52Apr 1, 2026Updated 3 months ago
chang-github-00 / LLM-Predictive-Decoding
View on GitHub
☆16Jul 9, 2025Updated last year
QizhiPei / MathFusion
View on GitHub
MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)
☆37Jul 16, 2025Updated last year
tml1026 / RoleCraft
View on GitHub
☆21Feb 15, 2024Updated 2 years ago
CLUEbenchmark / Math24o
View on GitHub
Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark
☆14Mar 27, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ritzz-ai / PACS
View on GitHub
☆31Sep 12, 2025Updated 10 months ago
RainBowLuoCS / OpenOmni
View on GitHub
(NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Align…
☆142May 9, 2026Updated 2 months ago
CLR-Lab / SimKO
View on GitHub
SimKO: Simple Pass@K Policy Optimization
☆31Oct 24, 2025Updated 9 months ago
IPBench / IPBench
View on GitHub
[ACL 2026] Repository of IPBench
☆23Apr 6, 2026Updated 3 months ago
rhyang2021 / CogRouter
View on GitHub
Source code for our paper: "Think Fast and Slow: Step-Level Cognitive Depth Adaptation for LLM Agents".
☆24Feb 20, 2026Updated 5 months ago
zwhong714 / weak-to-strong-preference-optimization
View on GitHub
[ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model
☆18Feb 24, 2025Updated last year
AIRobotZhang / SCDL
View on GitHub
Improving Distantly-Supervised Named Entity Recognition with Self-Collaborative Denoising Learning
☆14Apr 11, 2022Updated 4 years ago
open-compass / RePro
View on GitHub
[ICLR 2026] Rectifying LLM Thought From Lens of Optimization
☆15Dec 5, 2025Updated 7 months ago
II-Bench / II-Bench
View on GitHub
☆28Oct 28, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
VityaVitalich / STASC
View on GitHub
[ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models
☆11Sep 19, 2025Updated 10 months ago
uservan / speculative_thinking
View on GitHub
☆34Oct 13, 2025Updated 9 months ago
DocTron-hub / OCRVerse
View on GitHub
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models
☆30Feb 4, 2026Updated 5 months ago
GraphPKU / number_cookbook
View on GitHub
Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.
☆21Mar 31, 2025Updated last year
zmzhang2000 / trustworthy-alignment
View on GitHub
Official repository for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
☆12Sep 2, 2024Updated last year
linkedin / ControlLLM
View on GitHub
Control LLM
☆23Apr 6, 2025Updated last year
BraveGroup / PointSAM-for-MixSup
View on GitHub
[ICLR 2024] MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection
☆75Jul 10, 2024Updated 2 years ago