The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications".
☆266Jun 22, 2026Updated last week
Alternatives and similar repositories for Awesome-RL-based-Agentic-Search-Papers
Users that are interested in Awesome-RL-based-Agentic-Search-Papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆80Mar 5, 2026Updated 3 months ago
- ☆12Mar 31, 2025Updated last year
- [TMM'26] Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.☆142May 23, 2025Updated last year
- 🧬 Python code that implements the active finite Voronoi (AFV) model.☆22Jun 12, 2026Updated 2 weeks ago
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆130May 26, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The official code of "CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval"☆15Sep 19, 2024Updated last year
- [ICML'26 Spotlight] What is the right loss function for LLM supervised finetuning?☆64May 28, 2026Updated last month
- ☆153Jan 2, 2024Updated 2 years ago
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆178Nov 3, 2025Updated 7 months ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆46Aug 25, 2025Updated 10 months ago
- ☆16May 18, 2026Updated last month
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆435Jun 2, 2026Updated last month
- ☆44Jul 28, 2025Updated 11 months ago
- AI-powered tool for analyzing GitHub trending repositories and URL metadata☆27Jun 7, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An adaptive sampling framework for Reinforce-style LLM post training.☆96Nov 29, 2025Updated 7 months ago
- ☆29Mar 10, 2026Updated 3 months ago
- ☆15Jan 27, 2026Updated 5 months ago
- [NeurIPS 2025] Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning☆140Jun 25, 2026Updated last week
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆20Nov 22, 2025Updated 7 months ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆17Oct 20, 2025Updated 8 months ago
- ☆20Jun 10, 2025Updated last year
- [ACL 2026 Oral] "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"☆600May 22, 2026Updated last month
- Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"☆238Oct 19, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Jun 10, 2025Updated last year
- [ACM MM'2024] Official repository for "Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval"☆43Dec 23, 2024Updated last year
- Heterogeneous Containerization of Agents☆112Jul 29, 2025Updated 11 months ago
- ☆122Aug 29, 2025Updated 10 months ago
- [EMNLP 2022] TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data☆17May 17, 2023Updated 3 years ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆22Jun 26, 2025Updated last year
- ☆14Jul 26, 2017Updated 8 years ago
- The dataset, code, and model for our paper "Reflection Generation for Composite Image Using Diffusion Model", ICME, 2026.☆58Apr 4, 2026Updated 2 months ago
- Programmable chat templates for LLM training and inference.☆121Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- RAG methods, benchmarks, and toolkits☆19Nov 28, 2024Updated last year
- ☆30May 24, 2025Updated last year
- [WIP] A key value separation time series data storage engine inspired by Wisckey and TSM.☆15May 27, 2025Updated last year
- ☆40Apr 21, 2025Updated last year
- RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of aut…☆531Nov 5, 2025Updated 7 months ago
- Least Squares Regression for subspace clustering☆11May 27, 2018Updated 8 years ago
- 🚀 轻量视频🎥 大模型🤖☆23Apr 27, 2025Updated last year