Ruiyang-061X / Awesome-Search-RLLinks
☆42Updated 4 months ago
Alternatives and similar repositories for Awesome-Search-RL
Users that are interested in Awesome-Search-RL are comparing it to the libraries listed below
Sorting:
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆66Updated 5 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆146Updated 5 months ago
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆145Updated last week
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆60Updated 3 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆162Updated 3 months ago
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆119Updated 2 weeks ago
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆40Updated 3 months ago
- Awesome LLM pre-training resources, including data, frameworks, and methods.☆269Updated 6 months ago
- ☆102Updated last year
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆89Updated 5 months ago
- diagnosis_zero, R1 Zero reproduce on disease diagnosis☆33Updated 3 months ago
- Reading List of Memory Augmented Multimodal Research, including multimodal context modeling, memory in vision and robotics, and external …☆45Updated last year
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]☆238Updated 3 months ago
- 💡 Awesome RAG: An up-to-date list of Retrieval-Augmented Generation (RAG) for LLMs, focusing on the development of technology.☆326Updated 2 weeks ago
- MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)☆73Updated 2 months ago
- ☆50Updated 7 months ago
- 珠算代码大模型(Abacus Code LLM)☆56Updated last year
- ☆99Updated 2 weeks ago
- Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors (ACL Findings 2025)☆84Updated 4 months ago
- The demo, code and data of FollowRAG☆75Updated 4 months ago
- Awesome Deep Research list! For more details, please refer to our survey paper -- A Comprehensive Survey of Deep Research: Systems, Metho…☆345Updated last week
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆279Updated 2 weeks ago
- [NeurIPS 2024] Personal Agentic AI for MultiAgent Cooperation☆87Updated 10 months ago
- Open replication of DeepSeek R1 for text-to-graph extraction.☆99Updated 8 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆96Updated 3 weeks ago
- ☆154Updated 3 weeks ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆57Updated 4 months ago
- A Comprehensive Library for Memory of LLM-based Agents.☆86Updated 5 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆190Updated 3 weeks ago
- ☆68Updated this week