wizardlancet / diagnosis_zeroLinks
diagnosis_zero, R1 Zero reproduce on disease diagnosis
☆29Updated 4 months ago
Alternatives and similar repositories for diagnosis_zero
Users that are interested in diagnosis_zero are comparing it to the libraries listed below
Sorting:
- ☆54Updated 7 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆58Updated last month
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆78Updated last month
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆71Updated last month
- An Open Math Pre-trainng Dataset with 370B Tokens.☆89Updated 2 months ago
- ☆37Updated 2 weeks ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆20Updated last month
- ☆41Updated this week
- An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.☆154Updated this week
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆93Updated 2 weeks ago
- ☆103Updated 6 months ago
- DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking☆47Updated 3 months ago
- ☆49Updated 3 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆57Updated 8 months ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆15Updated 4 months ago
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆108Updated 2 weeks ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆80Updated 6 months ago
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆89Updated 9 months ago
- ☆43Updated 3 months ago
- ☆33Updated this week
- ☆56Updated 6 months ago
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆45Updated 2 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆39Updated last year
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning☆57Updated 3 months ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆135Updated last week
- ☆47Updated 2 weeks ago
- o1 Chain of Thought Examples☆33Updated 8 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆72Updated 2 months ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation☆22Updated 3 weeks ago
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆29Updated 2 months ago