The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications".
☆229May 11, 2026Updated last week
Alternatives and similar repositories for Awesome-RL-based-Agentic-Search-Papers
Users that are interested in Awesome-RL-based-Agentic-Search-Papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆80Mar 5, 2026Updated 2 months ago
- Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.☆142May 23, 2025Updated last year
- 🧬 Python code that implements the active finite Voronoi (AFV) model.☆22May 13, 2026Updated last week
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆129Apr 26, 2026Updated 3 weeks ago
- The official code of "CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval"☆15Sep 19, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- INFTY Engine: An Optimization Toolkit to Support Continual AI☆570Sep 13, 2025Updated 8 months ago
- official implementation for paper titled "Training-free Horizon Extension for Autoregressive Video Generation"☆112Feb 17, 2026Updated 3 months ago
- [ICML'26 Spotlight] Beyond log-likelihood: exploring alternative objectives for supervised fine-tuning of language model post-training☆60Oct 4, 2025Updated 7 months ago
- ☆153Jan 2, 2024Updated 2 years ago
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆165Nov 3, 2025Updated 6 months ago
- ☆58Feb 24, 2026Updated 2 months ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆45Aug 25, 2025Updated 8 months ago
- ☆233Nov 5, 2025Updated 6 months ago
- ☆16Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆418May 13, 2026Updated last week
- ☆43Jul 28, 2025Updated 9 months ago
- CPG-SPMT: Control-oriented Parameter-Grouped Single Particle Model with Thermal effects☆79Apr 22, 2026Updated last month
- An adaptive sampling framework for Reinforce-style LLM post training.☆96Nov 29, 2025Updated 5 months ago
- [NeurIPS 2025] Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning☆136Dec 13, 2025Updated 5 months ago
- ☆28Mar 10, 2026Updated 2 months ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 6 months ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆17Oct 20, 2025Updated 7 months ago
- ☆20Jun 10, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ACL 2026] "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"☆606Apr 7, 2026Updated last month
- Gotta Hear Them All: Towards Sound Source Aware Audio Generation.☆69Nov 15, 2025Updated 6 months ago
- ☆17Jun 10, 2025Updated 11 months ago
- ☆120Aug 29, 2025Updated 8 months ago
- [EMNLP 2022] TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data☆17May 17, 2023Updated 3 years ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆22Jun 26, 2025Updated 10 months ago
- The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…☆47May 15, 2025Updated last year
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆336Apr 7, 2026Updated last month
- The dataset, code, and model for our paper "Reflection Generation for Composite Image Using Diffusion Model", ICME, 2026.☆57Apr 4, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆30May 24, 2025Updated 11 months ago
- [WIP] A key value separation time series data storage engine inspired by Wisckey and TSM.☆15May 27, 2025Updated 11 months ago
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!☆143Sep 27, 2025Updated 7 months ago
- RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of aut…☆522Nov 5, 2025Updated 6 months ago
- ☆40Apr 21, 2025Updated last year
- [ICLR'26] MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆48Apr 17, 2026Updated last month
- ☆36Feb 21, 2025Updated last year