wizardlancet / diagnosis_zeroLinks

diagnosis_zero, R1 Zero reproduce on disease diagnosis

☆30

Alternatives and similar repositories for diagnosis_zero

Users that are interested in diagnosis_zero are comparing it to the libraries listed below

Sorting:

hzy312 / knowledge-r1
IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent
☆60Updated 2 months ago
ResearAI / Awesome-AI-Scientist
This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies
☆77Updated 2 months ago
Gen-Verse / ScoreFlow
Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"
☆79Updated last month
RUC-NLPIR / HiRA
The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search
☆45Updated last week
Aegis1863 / LLMs-Distillation-Quantification
Repo of ACL 2025 main Paper "Quantification of Large Language Model Distillation"
☆88Updated last month
Open-Source-O1 / o1_Reasoning_Patterns_Study
☆102Updated 7 months ago
LLM360 / MegaMath
[COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.
☆95Updated 3 months ago
AkariAsai / ScholarQABench
This repository contains ScholarQABench data and evaluation pipeline.
☆73Updated 3 months ago
Ruiyang-061X / Awesome-Search-RL
☆38Updated last month
google-deepmind / llms_can_learn_rules
☆56Updated 7 months ago
NEUIR / M2RAG
This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".
☆37Updated 4 months ago
Bui1dMySea / MemLong
☆94Updated 7 months ago
du-nlp-lab / MLR-Copilot
☆66Updated 3 months ago
shizhediao / Post-Training-Data-Flywheel
We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.
☆57Updated 9 months ago
yale-nlp / MCTS-RAG
☆57Updated 2 weeks ago
OPPO-PersonalAI / OAgents
Implementation for OAgents: An Empirical Study of Building Effective Agents
☆82Updated this week
THU-KEG / Agentic-Reward-Modeling
[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆96Updated last month
DevoAllen / Awesome-Reasoning-Economy-Papers
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models
☆110Updated last month
LightChen233 / Awesome-LLM-for-NLP
☆99Updated last year
thu-coai / SPaR
☆47Updated last month
dongxiangjue / Awesome-LLM-Self-Improvement
A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …
☆85Updated 6 months ago
multimodal-art-projection / DailyPaper
☆54Updated 8 months ago
THUDM / ChatGLM-Math
☆82Updated last year
hellangleZ / Qwen3_autothink_adapter
Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…
☆21Updated 2 months ago
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆33Updated last year
HIT-SCIR / Abacus
珠算代码大模型（Abacus Code LLM）
☆55Updated 9 months ago
SivilTaram / code-html-to-markdown
A lightweight script for processing HTML page to markdown format with support for code blocks
☆79Updated last year
gersteinlab / ChemAgent
[ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning
☆62Updated 4 months ago
FreedomIntelligence / FastLLM
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
☆40Updated last year
McGill-NLP / agent-reward-bench
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
☆27Updated this week