Ruiyang-061X / Awesome-Search-RLLinks
☆39Updated 2 months ago
Alternatives and similar repositories for Awesome-Search-RL
Users that are interested in Awesome-Search-RL are comparing it to the libraries listed below
Sorting:
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆62Updated 3 months ago
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆114Updated this week
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆53Updated last month
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆114Updated 2 months ago
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆36Updated last month
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆140Updated 3 months ago
- ☆100Updated last year
- Awesome-Large-Search-Models is a collection of papers and resources (Methods, Datasets and other resources) about awesome agentic search …☆118Updated last week
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆145Updated last month
- ☆50Updated 5 months ago
- Awesome LLM pre-training resources, including data, frameworks, and methods.☆230Updated 3 months ago
- diagnosis_zero, R1 Zero reproduce on disease diagnosis☆30Updated last month
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆82Updated 3 months ago
- Open replication of DeepSeek R1 for text-to-graph extraction.☆98Updated 6 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆53Updated 2 months ago
- ☆103Updated 8 months ago
- MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)☆66Updated last week
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]☆228Updated last month
- 珠算代码大模型(Abacus Code LLM)☆55Updated 11 months ago
- A Comprehensive Library for Memory of LLM-based Agents.☆69Updated 3 months ago
- Awesome Deep Research list☆292Updated 2 months ago
- The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"☆128Updated last year
- connecting humans and agents☆88Updated 8 months ago
- MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.☆251Updated this week
- Reading List of Memory Augmented Multimodal Research, including multimodal context modeling, memory in vision and robotics, and external …☆43Updated 11 months ago
- An Awesome List of Agentic Model trained with Reinforcement Learning☆370Updated last week
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆98Updated 4 months ago
- ☆55Updated 9 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆58Updated 10 months ago
- [Up-to-date] Awesome Agentic Deep Research Resources☆407Updated last month