FareedKhan-dev / train-deepseek-r1View external linksLinks
Building DeepSeek R1 from Scratch
☆744Mar 21, 2025Updated 10 months ago
Alternatives and similar repositories for train-deepseek-r1
Users that are interested in train-deepseek-r1 are comparing it to the libraries listed below
Sorting:
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆75Aug 18, 2025Updated 5 months ago
- A straightforward method for training your LLM, from downloading data to generating text.☆515Aug 3, 2025Updated 6 months ago
- DeepSeek 系列工作解读、扩展和复现。☆700Mar 29, 2025Updated 10 months ago
- 该系列的目的是让读者可以 在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及训练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。☆379Aug 28, 2025Updated 5 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆77Apr 4, 2025Updated 10 months ago
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆798Mar 13, 2025Updated 11 months ago
- Large Language Model in Action☆342Jan 28, 2025Updated last year
- Implementation of 12 AI agents evaluation techniques☆35Jul 31, 2025Updated 6 months ago
- Fully open reproduction of DeepSeek-R1☆25,866Nov 24, 2025Updated 2 months ago
- Fetch arxiv data to LLM-friendly text☆128Jan 31, 2026Updated 2 weeks ago
- LLMs-from-scratch项目中文翻译☆2,305Oct 15, 2025Updated 3 months ago
- Maximizing the Performance of a Simple RAG using RL☆90Mar 20, 2025Updated 10 months ago
- 🧑🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.☆7,501Feb 6, 2026Updated last week
- Implementation of all RL algorithms in a simpler way☆1,393Aug 29, 2025Updated 5 months ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆200Aug 23, 2024Updated last year
- ☆134Feb 17, 2025Updated 11 months ago
- Awesome Reasoning LLM Tutorial/Survey/Guide☆2,286Oct 14, 2025Updated 4 months ago
- support BM25+vecetor☆29May 26, 2025Updated 8 months ago
- ☆762Dec 23, 2025Updated last month
- Creating the DeepSeek V3 model from scratch☆24Mar 28, 2025Updated 10 months ago
- Minimal reproduction of DeepSeek R1-Zero☆12,715Apr 24, 2025Updated 9 months ago
- ☆19Jul 21, 2025Updated 6 months ago
- recursive rag with r1 reasoning☆330May 21, 2025Updated 8 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated 9 months ago
- Open replication of DeepSeek R1 for text-to-graph extraction.☆99Jan 31, 2025Updated last year
- A very simple GRPO implement for reproducing r1-like LLM thinking.☆1,575Nov 21, 2025Updated 2 months ago
- This repository is a collection of legal instruction datasets☆26Jul 12, 2024Updated last year
- [ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆512May 23, 2025Updated 8 months ago
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆527Mar 23, 2025Updated 10 months ago
- Official Repo for Open-Reasoner-Zero☆2,087Jun 2, 2025Updated 8 months ago
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)☆8,989Feb 6, 2026Updated last week
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆39,326Feb 6, 2026Updated last week
- Reproduce R1 Zero on Logic Puzzle☆2,432Mar 20, 2025Updated 10 months ago
- Democratizing Reinforcement Learning for LLMs☆5,081Feb 7, 2026Updated last week
- Awesome Deep Research list! For more details, please refer to our survey paper -- A Comprehensive Survey of Deep Research: Systems, Metho…☆406Oct 22, 2025Updated 3 months ago
- Deepseek-r1复现科普与资源汇总☆22Mar 5, 2025Updated 11 months ago
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (…☆12,594Updated this week
- ☆12Jan 9, 2024Updated 2 years ago
- O1 Replication Journey☆2,000Jan 14, 2025Updated last year