wizardlancet / diagnosis_zero
diagnosis_zero, R1 Zero reproduce on disease diagnosis
☆29Updated 2 months ago
Alternatives and similar repositories for diagnosis_zero:
Users that are interested in diagnosis_zero are comparing it to the libraries listed below
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆64Updated 2 months ago
- Knowledge-Reasoning Synergy Reinforcement Learning.☆34Updated last month
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆81Updated 6 months ago
- ☆47Updated 4 months ago
- ☆53Updated 5 months ago
- An Open Math Pre-trainng Dataset with 370B Tokens.☆72Updated 3 weeks ago
- ☆101Updated 4 months ago
- o1 Chain of Thought Examples☆33Updated 6 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆86Updated last month
- ☆40Updated last month
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆101Updated 3 weeks ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 2 months ago
- ☆56Updated 4 months ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆31Updated last month
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆75Updated 4 months ago
- ☆48Updated 2 months ago
- Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning☆38Updated 2 months ago
- ☆62Updated 3 weeks ago
- Tina: Tiny Reasoning Models via LoRA☆55Updated this week
- 珠算代码大模型(Abacus Code LLM)☆56Updated 7 months ago
- From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation☆88Updated last month
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆32Updated last month
- ☆46Updated last month
- ☆82Updated last year
- ☆56Updated 5 months ago
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆40Updated 2 weeks ago
- ☆43Updated last month
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆43Updated 4 months ago
- ☆55Updated 2 weeks ago
- ☆62Updated 2 months ago