StarRing2022 / R1-NatureLinks

最简易的R1结果在小模型上的复现，阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证，对于强推理能力，think思考过程性内容是AGI/ASI的核心。

☆45

Alternatives and similar repositories for R1-Nature

Users that are interested in R1-Nature are comparing it to the libraries listed below

Sorting:

Bui1dMySea / MemLong
☆95Updated 10 months ago
Liuziyu77 / Soda
Search, organize, discover anything!
☆48Updated last year
thunlp / Delta-CoMe
Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024
☆57Updated 11 months ago
Alibaba-NLP / MaskSearch
Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"
☆146Updated 4 months ago
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆33Updated last year
zai-org / GLM-Edge
GLM Series Edge Models
☆149Updated 4 months ago
MikeGu721 / AgentGroup
☆94Updated last year
Gen-Verse / ScoreFlow
Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"
☆87Updated 5 months ago
shibing624 / open-o1
open-o1: Using GPT-4o with CoT to Create o1-like Reasoning Chains
☆116Updated 9 months ago
icip-cas / DeepSolution
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking
☆49Updated 7 months ago
bigai-nlco / TokenSwift
[ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation
☆115Updated 5 months ago
KwaiKEG / CogGPT
Unleashing the Power of Cognitive Dynamics on Large Language Models
☆63Updated last year
Zheng0428 / COIG-Kun
☆36Updated last year
xverse-ai / XVERSE-V-13B
☆79Updated last year
xverse-ai / XVERSE-MoE-A36B
XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.
☆38Updated last year
Chen-GX / C-3PO
[ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…
☆40Updated 5 months ago
Fu-Dayuan / AgentRefine
(ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning
☆18Updated 8 months ago
bingreeky / GMemory
☆98Updated last week
krystalan / DRT
Deep Reasoning Translation (DRT) Project
☆233Updated last month
MiroMindAI / MiroMind-M1
MiroMind-M1 is a fully open-source series of reasoning language models built on Qwen-2.5, focused on advancing mathematical reasoning.
☆236Updated 2 months ago
RUC-NLPIR / HiRA
The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search
☆59Updated 3 months ago
cooper12121 / llama3-8x8b-MoE
Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…
☆27Updated last year
FlagOpen / Infinity-Instruct
☆49Updated last year
WePOINTS / WePOINTS
☆186Updated 8 months ago
MonolithFoundation / Bumblebee
A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.
☆38Updated last year
hzy312 / knowledge-r1
IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent
☆66Updated 5 months ago
FreedomIntelligence / FastLLM
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
☆41Updated last year
FudanNLPLAB / MouSi
☆74Updated last year
vaew / SkyScript-100M
SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2
☆127Updated 11 months ago
xverse-ai / XVERSE-MoE-A4.2B
XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.
☆39Updated last year