wizardlancet / diagnosis_zeroLinks
diagnosis_zero, R1 Zero reproduce on disease diagnosis
☆30Updated 5 months ago
Alternatives and similar repositories for diagnosis_zero
Users that are interested in diagnosis_zero are comparing it to the libraries listed below
Sorting:
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆60Updated 2 months ago
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆77Updated 2 months ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆79Updated last month
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆45Updated last week
- Repo of ACL 2025 main Paper "Quantification of Large Language Model Distillation"☆88Updated last month
- ☆102Updated 7 months ago
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆95Updated 3 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆73Updated 3 months ago
- ☆38Updated last month
- ☆56Updated 7 months ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆37Updated 4 months ago
- ☆94Updated 7 months ago
- ☆66Updated 3 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆57Updated 9 months ago
- ☆57Updated 2 weeks ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆82Updated this week
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆96Updated last month
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆110Updated last month
- ☆99Updated last year
- ☆47Updated last month
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆85Updated 6 months ago
- ☆54Updated 8 months ago
- ☆82Updated last year
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆21Updated 2 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- 珠算代码大模型(Abacus Code LLM)☆55Updated 9 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning☆62Updated 4 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated last year
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆27Updated this week