Ruiyang-061X / Awesome-Search-RLLinks
☆44Updated 6 months ago
Alternatives and similar repositories for Awesome-Search-RL
Users that are interested in Awesome-Search-RL are comparing it to the libraries listed below
Sorting:
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆67Updated 6 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 5 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆146Updated 6 months ago
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆120Updated last month
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆163Updated last month
- diagnosis_zero, R1 Zero reproduce on disease diagnosis☆34Updated 4 months ago
- Reading List of Memory Augmented Multimodal Research, including multimodal context modeling, memory in vision and robotics, and external …☆50Updated last year
- ☆102Updated last year
- Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors (ACL Findings 2025)☆86Updated 6 months ago
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]☆244Updated 5 months ago
- ☆55Updated last year
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆174Updated 5 months ago
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆93Updated 6 months ago
- 珠算代码大模型(Abacus Code LLM)☆57Updated last year
- [NeurIPS 2024] Personal Agentic AI for MultiAgent Cooperation☆87Updated last year
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆211Updated 2 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆103Updated 2 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆291Updated last month
- MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)☆73Updated 3 months ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆39Updated 3 months ago
- MrlX: A Multi-Agent Reinforcement Learning Framework☆152Updated 2 weeks ago
- SSRL: Self-Search Reinforcement Learning☆158Updated 3 months ago
- ☆52Updated 9 months ago
- ☆185Updated this week
- The demo, code and data of FollowRAG☆75Updated 5 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 7 months ago
- Deep Reasoning Translation (DRT) Project☆239Updated 3 months ago
- Open replication of DeepSeek R1 for text-to-graph extraction.☆99Updated 10 months ago
- SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.☆100Updated 7 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆183Updated last month