wizardlancet / diagnosis_zeroLinks
diagnosis_zero, R1 Zero reproduce on disease diagnosis
☆34Updated 5 months ago
Alternatives and similar repositories for diagnosis_zero
Users that are interested in diagnosis_zero are comparing it to the libraries listed below
Sorting:
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆95Updated 7 months ago
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆168Updated 2 months ago
- ☆104Updated last year
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆68Updated 8 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 6 months ago
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆94Updated last year
- ☆45Updated 7 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆124Updated 7 months ago
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆122Updated 3 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆147Updated 7 months ago
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆84Updated 2 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆113Updated 3 weeks ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆61Updated last year
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆111Updated 3 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆229Updated 3 months ago
- ☆96Updated last year
- Repo of ACL 2025 Paper "Quantification of Large Language Model Distillation"☆93Updated 5 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 11 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 8 months ago
- ☆53Updated 10 months ago
- ☆54Updated last year
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆303Updated 3 months ago
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]☆249Updated 6 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Updated last month
- ☆46Updated 7 months ago
- Open replication of DeepSeek R1 for text-to-graph extraction.☆99Updated 11 months ago
- MrlX: A Multi-Agent Reinforcement Learning Framework☆161Updated last month
- This repository contains ScholarQABench data and evaluation pipeline.☆93Updated 5 months ago
- a curated list of the role of small models in the LLM era☆111Updated last year