[ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents
☆42Mar 7, 2026Updated 3 weeks ago
Alternatives and similar repositories for IGPO
Users that are interested in IGPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆49Feb 2, 2026Updated last month
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆22Dec 16, 2024Updated last year
- ☆21Feb 15, 2024Updated 2 years ago
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆28Aug 9, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆21Dec 14, 2024Updated last year
- Released code for「Target-to-Source Augmentation for Aspect Sentiment Triplet Extraction」in EMNLP2023.☆13Mar 28, 2024Updated last year
- MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning☆44Sep 3, 2025Updated 6 months ago
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners☆22Jun 6, 2025Updated 9 months ago
- Code for Paper ACL'25: FiDELIS: Faithful Reasoning of Large Language Model on Knowledge Graph Question Answering☆18May 8, 2025Updated 10 months ago
- [AAAI 2026 Oral] Automatic Multi-agent Communication Topology Design☆35Jan 30, 2026Updated last month
- ☆30Aug 21, 2025Updated 7 months ago
- This is the official implementation of the paper "Generative Retrieval with Semantic Tree-Structured Item Identifiers via Contrastive Lea…☆25Dec 12, 2024Updated last year
- A unified robotic manipulation learning framework☆21Sep 4, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- extension for fabric to handle prompts through pexpect☆44May 31, 2015Updated 10 years ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆27May 13, 2025Updated 10 months ago
- ☆15Mar 20, 2023Updated 3 years ago
- [SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling☆47Sep 26, 2025Updated 6 months ago
- [TPAMI 2025] Revisiting Essential and Non-Essential Settings of Evidential Deep Learning☆25Jun 24, 2025Updated 9 months ago
- papers about recommender system.☆10May 18, 2021Updated 4 years ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 9 months ago
- ☆16Feb 10, 2023Updated 3 years ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆53Jan 5, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆28Feb 8, 2026Updated last month
- ☆15Nov 19, 2021Updated 4 years ago
- ☆28May 27, 2024Updated last year
- ☆21Jan 16, 2025Updated last year
- A curated collection of research and techniques for protecting intellectual property of large language models, including watermarking, fi…☆47Feb 15, 2026Updated last month
- ☆10Mar 24, 2023Updated 3 years ago
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆31Oct 2, 2025Updated 5 months ago
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆28Dec 8, 2023Updated 2 years ago
- The guideline for pod.☆10Jun 19, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 基于Model Context Protocol (MCP)的ComfyUI图像生成服务,通过API调用本地ComfyUI实例生成图片,实现自然语言生图自由☆23Nov 30, 2025Updated 3 months ago
- 一种尝试解决情绪分类任务中的不平衡问题的分类方法研究。☆10May 5, 2017Updated 8 years ago
- ☆28Aug 13, 2025Updated 7 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆292Oct 2, 2025Updated 5 months ago
- ☆10Jul 5, 2023Updated 2 years ago
- Sentiment Lexicon Construction☆10Sep 17, 2019Updated 6 years ago
- This repository tests various recurrent neural network architectures on baseline datasets SeqMNIST and pMNIST.☆23Oct 2, 2018Updated 7 years ago