Step-DeepResearch
☆514Feb 2, 2026Updated last month
Alternatives and similar repositories for StepDeepResearch
Users that are interested in StepDeepResearch are comparing it to the libraries listed below
Sorting:
- ☆25Jun 10, 2025Updated 8 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆159Jun 26, 2025Updated 8 months ago
- PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning☆328Feb 5, 2026Updated 3 weeks ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆69Dec 8, 2025Updated 2 months ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆31Updated this week
- Data Synthesis for Deep Research Based on Semi-Structured Data☆199Dec 18, 2025Updated 2 months ago
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 5 months ago
- ☆46Jun 11, 2025Updated 8 months ago
- ☆78Jun 20, 2025Updated 8 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 2 months ago
- ☆33Jul 15, 2025Updated 7 months ago
- ☆35Jul 19, 2025Updated 7 months ago
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆33Aug 13, 2025Updated 6 months ago
- Generative Modeling with Bayesian Sample Inference☆24May 17, 2025Updated 9 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆705Oct 15, 2025Updated 4 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Apr 11, 2025Updated 10 months ago
- [ICLR 2026] Geometric-Mean Policy Optimization☆100Jan 26, 2026Updated last month
- ☆90Oct 30, 2025Updated 4 months ago
- [NeurIPS 2025] The official repository for our paper, "Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reason…☆154Sep 12, 2025Updated 5 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆39Feb 7, 2026Updated 3 weeks ago
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆71Sep 8, 2025Updated 5 months ago
- This is the official implementation for MA-LoT.☆19Aug 4, 2025Updated 7 months ago
- [SIGIR 2025] Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph☆16Jun 6, 2025Updated 8 months ago
- ☆16Sep 17, 2024Updated last year
- OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model☆112Jul 17, 2025Updated 7 months ago
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,172Nov 17, 2025Updated 3 months ago
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆46Jul 17, 2025Updated 7 months ago
- Project page of "GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation"☆21Apr 3, 2023Updated 2 years ago
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆64Dec 10, 2025Updated 2 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆34Sep 1, 2025Updated 6 months ago
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Dec 12, 2025Updated 2 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆180Jul 8, 2025Updated 7 months ago
- ThinkDepth.ai Deep Research☆176Jan 5, 2026Updated last month
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆50Sep 4, 2025Updated 6 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,085Nov 13, 2025Updated 3 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated 3 weeks ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 9 months ago
- [ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible☆121Aug 10, 2025Updated 6 months ago
- ☆20Jul 23, 2025Updated 7 months ago