[ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents
☆61Apr 23, 2026Updated 2 weeks ago
Alternatives and similar repositories for IGPO
Users that are interested in IGPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆50Feb 2, 2026Updated 3 months ago
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆22Dec 16, 2024Updated last year
- ☆21Feb 15, 2024Updated 2 years ago
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆28Aug 9, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆21Dec 14, 2024Updated last year
- [FPGA-2022] N3H-Core: Neuron-designed Neural Network Accelerator via FPGA-based Heterogeneous Computing Cores☆11Dec 16, 2021Updated 4 years ago
- [NeurIPS 2024 Oral] Repository of the CMuST paper: "Get Rid of Isolation: A Continuous Multi-task Spatio-Temporal Learning Framework"☆14Mar 12, 2025Updated last year
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners☆22Jun 6, 2025Updated 11 months ago
- MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning☆45Sep 3, 2025Updated 8 months ago
- Code for Paper ACL'25: FiDELIS: Faithful Reasoning of Large Language Model on Knowledge Graph Question Answering☆23May 8, 2025Updated last year
- MENTOR is a highly efficient visual RL algorithm that excels in both simulation and real-world complex robotic learning tasks.☆27Jul 9, 2025Updated 9 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆64Sep 24, 2024Updated last year
- ☆32Aug 21, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Federated Reinforcement Learning☆12Jun 20, 2019Updated 6 years ago
- This is the official implementation of the paper "Generative Retrieval with Semantic Tree-Structured Item Identifiers via Contrastive Lea…☆26Dec 12, 2024Updated last year
- A unified robotic manipulation learning framework☆22Sep 4, 2025Updated 8 months ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆26May 13, 2025Updated 11 months ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆25Apr 26, 2026Updated last week
- ☆15Mar 20, 2023Updated 3 years ago
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆19Aug 9, 2024Updated last year
- An online federated reinforcement learning algorithm published in INFOCOM2024☆17Dec 1, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TensorFlow Tutorial and Examples for Beginners with Latest APIs☆23Jan 21, 2019Updated 7 years ago
- 原稿用紙;原稿紙;稿紙;日式便箋;UPTEX/UPLATEX 縱書☆10Nov 27, 2019Updated 6 years ago
- ☆39Jan 19, 2026Updated 3 months ago
- [ACL 2025] Adaptive Retrieval without Self-Knowledge? Bringing Uncertainty Back Home☆18May 17, 2025Updated 11 months ago
- [TPAMI 2025] Revisiting Essential and Non-Essential Settings of Evidential Deep Learning☆26Jun 24, 2025Updated 10 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 11 months ago
- ☆19Jul 7, 2025Updated 10 months ago
- The light codes for the paper published in JMS named 'Solving task scheduling problems in cloud manufacturing via attention mechanism and…☆20May 15, 2023Updated 2 years ago
- [AAAI 2026 Oral] Automatic Multi-agent Communication Topology Design☆44Jan 30, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆54Mar 30, 2026Updated last month
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆13Jul 1, 2024Updated last year
- ☆15Nov 19, 2021Updated 4 years ago
- ☆19Mar 13, 2023Updated 3 years ago
- ☆28May 27, 2024Updated last year
- Enemies for your LLM☆35Jan 20, 2026Updated 3 months ago
- ☆23Jan 16, 2025Updated last year