Ruiyang-061X / Awesome-Search-RLLinks
☆45Updated 7 months ago
Alternatives and similar repositories for Awesome-Search-RL
Users that are interested in Awesome-Search-RL are comparing it to the libraries listed below
Sorting:
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆68Updated 8 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 6 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆147Updated 7 months ago
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆168Updated 2 months ago
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆122Updated 3 months ago
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆95Updated 7 months ago
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆45Updated last month
- ☆53Updated 10 months ago
- [TMLR 2025] Reading List of Memory Augmented Multimodal Research, including multimodal context modeling, memory in vision and robotics, a…☆52Updated this week
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆182Updated 6 months ago
- ☆105Updated last year
- diagnosis_zero, R1 Zero reproduce on disease diagnosis☆34Updated 5 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆229Updated 3 months ago
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]☆249Updated 6 months ago
- 珠算代码大模型(Abacus Code LLM)☆58Updated last year
- A Systematic Survey of Deep Research☆271Updated 2 weeks ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆303Updated 3 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆111Updated 3 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆113Updated 3 weeks ago
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆137Updated last year
- The demo, code and data of FollowRAG☆75Updated 6 months ago
- ☆104Updated last year
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆67Updated 2 months ago
- [NeurIPS 2024] Personal Agentic AI for MultiAgent Cooperation☆87Updated last year
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆32Updated 7 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆60Updated 7 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Updated 5 months ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆191Updated this week
- SSRL: Self-Search Reinforcement Learning☆204Updated 5 months ago
- MrlX: A Multi-Agent Reinforcement Learning Framework☆161Updated last month