[ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents
☆77Apr 23, 2026Updated last month
Alternatives and similar repositories for IGPO
Users that are interested in IGPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆50Feb 2, 2026Updated 3 months ago
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆22Dec 16, 2024Updated last year
- Repo. for RLCF.☆15Apr 1, 2024Updated 2 years ago
- 🌟Official code of our AAAI26 paper 🔍WebFilter☆39Nov 9, 2025Updated 6 months ago
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 9 months ago
- ☆21Dec 14, 2024Updated last year
- Released code for「Target-to-Source Augmentation for Aspect Sentiment Triplet Extraction」in EMNLP2023.☆13Mar 28, 2024Updated 2 years ago
- Official Code Release for "Training a Generally Curious Agent"☆47May 18, 2025Updated last year
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners☆22Jun 6, 2025Updated 11 months ago
- MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning☆45Sep 3, 2025Updated 8 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆65Sep 24, 2024Updated last year
- ☆33Aug 21, 2025Updated 9 months ago
- extension for fabric to handle prompts through pexpect☆44May 31, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆25Apr 26, 2026Updated last month
- ☆15Mar 20, 2023Updated 3 years ago
- [SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling☆49Apr 17, 2026Updated last month
- SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts☆65Dec 1, 2025Updated 5 months ago
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- TensorFlow Tutorial and Examples for Beginners with Latest APIs☆23Jan 21, 2019Updated 7 years ago
- ☆39Jan 19, 2026Updated 4 months ago
- Short RL☆18Apr 16, 2026Updated last month
- [TPAMI 2025] Revisiting Essential and Non-Essential Settings of Evidential Deep Learning☆26Jun 24, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 11 months ago
- ☆19Jul 7, 2025Updated 10 months ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆55Mar 30, 2026Updated last month
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆13Jul 1, 2024Updated last year
- the open-source code of QAgent☆58Oct 14, 2025Updated 7 months ago
- ☆15Nov 19, 2021Updated 4 years ago
- ☆23Jan 16, 2025Updated last year
- A curated collection of research and techniques for protecting intellectual property of large language models, including watermarking, fi…☆47Feb 15, 2026Updated 3 months ago
- ☆10Mar 24, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the paper "Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documentss"☆15Oct 8, 2024Updated last year
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆28Dec 8, 2023Updated 2 years ago
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆32Oct 2, 2025Updated 7 months ago
- The guideline for pod.☆10Jun 19, 2020Updated 5 years ago
- Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models☆33Oct 6, 2025Updated 7 months ago
- ☆31Feb 8, 2026Updated 3 months ago
- 一种尝试解决情绪分类任务中的不平衡问题的分类方法研究。☆10May 5, 2017Updated 9 years ago