GuoqingWang1/IGPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GuoqingWang1/IGPO)

GuoqingWang1 / IGPO

[ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents

☆120

Alternatives and similar repositories for IGPO

Users that are interested in IGPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MozerWang / AMPO
View on GitHub
[ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents
☆51Feb 2, 2026Updated 5 months ago
MozerWang / DEMO
View on GitHub
[ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling
☆22Dec 16, 2024Updated last year
tml1026 / RoleCraft
View on GitHub
☆21Feb 15, 2024Updated 2 years ago
CSQianDong / RLCF
View on GitHub
Repo. for RLCF.
☆15Apr 1, 2024Updated 2 years ago
NJUNLP / Hallu-PI
View on GitHub
The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …
☆11Sep 27, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rhyang2021 / ARIA
View on GitHub
Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".
☆30Aug 9, 2025Updated 10 months ago
SeanLeng1 / Reward-Calibration
View on GitHub
☆21Dec 14, 2024Updated last year
HITSZ-HLT / T2S-Augmentation
View on GitHub
Released code for「Target-to-Source Augmentation for Aspect Sentiment Triplet Extraction」in EMNLP2023.
☆13Mar 28, 2024Updated 2 years ago
tajwarfahim / paprika
View on GitHub
Official Code Release for "Training a Generally Curious Agent"
☆48May 18, 2025Updated last year
qhjqhj00 / MetaAgent
View on GitHub
MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning
☆47Sep 3, 2025Updated 10 months ago
Y-Sui / FiDeLiS
View on GitHub
Code for Paper ACL'25: FiDELIS: Faithful Reasoning of Large Language Model on Knowledge Graph Question Answering
☆22May 8, 2025Updated last year
suninghuang19 / mentor
View on GitHub
MENTOR is a highly efficient visual RL algorithm that excels in both simulation and real-world complex robotic learning tasks.
☆27Jul 9, 2025Updated 11 months ago
KwaiKEG / CogGPT
View on GitHub
Unleashing the Power of Cognitive Dynamics on Large Language Models
☆65Sep 24, 2024Updated last year
shivamag125 / EM_PT
View on GitHub
☆32Aug 21, 2025Updated 10 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
renatolfc / sched-rl-gym
View on GitHub
☆19Updated this week
daniel-merrick / Learning-from-Simulated-and-Unsupervised-Images-through-Adversarial-Training-SimGAN-PyTorch
View on GitHub
PyTorch implementation of 'Learning from Simulated and Unsupervised Images through Adversarial Training'
☆16Jun 16, 2020Updated 6 years ago
kiddyboots216 / FedRL
View on GitHub
Federated Reinforcement Learning
☆12Jun 20, 2019Updated 7 years ago
Universal-Control / ppt_learning
View on GitHub
A unified robotic manipulation learning framework
☆23Sep 4, 2025Updated 10 months ago
JasperVanDenBosch / fexpect
View on GitHub
extension for fabric to handle prompts through pexpect
☆44May 31, 2015Updated 11 years ago
tongxuluo / LeaP
View on GitHub
Code, Data and Model for Paper "Learning from Peers in Reasoning Models"
☆26May 13, 2025Updated last year
ahnjaewoo / FlashAdventure
View on GitHub
🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"
☆26Apr 26, 2026Updated 2 months ago
JiahaoChen1 / Calibration
View on GitHub
☆15Mar 20, 2023Updated 3 years ago
hyz317 / CHARM
View on GitHub
[SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling
☆49Apr 17, 2026Updated 2 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
CLUEbenchmark / Math24o
View on GitHub
Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark
☆13Mar 27, 2025Updated last year
lfoppiano / material-parsers
View on GitHub
Material parsers and other tools, scripts Initially developed for Grobid Superconductor
☆14Feb 21, 2025Updated last year
PKU-RL / AdaRefiner
View on GitHub
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
☆19Aug 9, 2024Updated last year
rhklite / Parallel-PPO-PyTorch
View on GitHub
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
☆21May 26, 2021Updated 5 years ago
dataSnail / RSpapers
View on GitHub
papers about recommender system.
☆10May 18, 2021Updated 5 years ago
HansenHua / MFPO-INFOCOM24
View on GitHub
An online federated reinforcement learning algorithm published in INFOCOM2024
☆16Dec 1, 2024Updated last year
kumarahlad / TensorFlow-Examples
View on GitHub
TensorFlow Tutorial and Examples for Beginners with Latest APIs
☆23Jan 21, 2019Updated 7 years ago
hzc1208 / ANN2SNN_COS
View on GitHub
☆16Feb 10, 2023Updated 3 years ago
Table-R1 / Table-R1
View on GitHub
[EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"
☆32Jun 3, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Update-For-Integrated-Business-AI / CORU
View on GitHub
☆19Jul 7, 2025Updated last year
Shiy-Li / ARG-Designer
View on GitHub
[AAAI 2026 Oral] Automatic Multi-agent Communication Topology Design
☆47May 10, 2026Updated last month
THU-KEG / VerIF
View on GitHub
[EMNLP 2025] Verification Engineering for RL in Instruction Following
☆56Mar 30, 2026Updated 3 months ago
phosseini / GisPy
View on GitHub
GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/
☆13Jul 1, 2024Updated 2 years ago
fengbinzhu / Doc2SoarGraph
View on GitHub
The repo of the Doc2SoarGraph framework
☆10Sep 17, 2024Updated last year
franrruiz / vcd_divergence
View on GitHub
Code to minimize the Variational Contrastive Divergence (VCD)
☆30May 30, 2019Updated 7 years ago
chen700564 / causalFSED
View on GitHub
☆16Nov 19, 2021Updated 4 years ago