The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications".
☆223Apr 25, 2026Updated last week
Alternatives and similar repositories for Awesome-RL-based-Agentic-Search-Papers
Users that are interested in Awesome-RL-based-Agentic-Search-Papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An official PyTorch implementation of "Certifiably Robust Graph Contrastive Learning" (NeurIPS 2023)☆11Jan 22, 2024Updated 2 years ago
- ☆11Mar 31, 2025Updated last year
- 🧬 Python code that implements the active finite Voronoi (AFV) model.☆21Apr 24, 2026Updated last week
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆128Apr 26, 2026Updated last week
- The official code of "CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval"☆15Sep 19, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- INFTY Engine: An Optimization Toolkit to Support Continual AI☆568Sep 13, 2025Updated 7 months ago
- official implementation for paper titled "Training-free Horizon Extension for Autoregressive Video Generation"☆112Feb 17, 2026Updated 2 months ago
- Code, notebooks, and other material for FuturePath AI's training course on Generative AI☆12Apr 24, 2025Updated last year
- ☆153Jan 2, 2024Updated 2 years ago
- ☆57Feb 24, 2026Updated 2 months ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆45Aug 25, 2025Updated 8 months ago
- ☆232Nov 5, 2025Updated 5 months ago
- ☆16Sep 17, 2024Updated last year
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆409Apr 16, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆43Jul 28, 2025Updated 9 months ago
- ☆25Jun 27, 2022Updated 3 years ago
- AI-powered tool for analyzing GitHub trending repositories and URL metadata☆25Apr 1, 2026Updated last month
- An adaptive sampling framework for Reinforce-style LLM post training.☆95Nov 29, 2025Updated 5 months ago
- [NeurIPS 2025] Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning☆135Dec 13, 2025Updated 4 months ago
- ☆28Mar 10, 2026Updated last month
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 5 months ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 6 months ago
- [ACL 2026] "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"☆599Apr 7, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆17Jun 10, 2025Updated 10 months ago
- ☆10Jul 19, 2021Updated 4 years ago
- ☆119Aug 29, 2025Updated 8 months ago
- [EMNLP 2022] TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data☆17May 17, 2023Updated 2 years ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆22Jun 26, 2025Updated 10 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆24Sep 21, 2025Updated 7 months ago
- The dataset, code, and model for our paper "Reflection Generation for Composite Image Using Diffusion Model", ICME, 2026.☆57Apr 4, 2026Updated 3 weeks ago
- RAG methods, benchmarks, and toolkits☆19Nov 28, 2024Updated last year
- ☆29May 24, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!☆143Sep 27, 2025Updated 7 months ago
- RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of aut…☆520Nov 5, 2025Updated 5 months ago
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆45Apr 17, 2026Updated 2 weeks ago
- Least Squares Regression for subspace clustering☆11May 27, 2018Updated 7 years ago
- ☆36Feb 21, 2025Updated last year
- Code implementation for paper AbsenceBench: Language Models Can't Tell What's Missing☆19Oct 23, 2025Updated 6 months ago
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆37Oct 9, 2025Updated 6 months ago