FareedKhan-dev / train-deepseek-r1
Building DeepSeek R1 from Scratch
☆592Updated last month
Alternatives and similar repositories for train-deepseek-r1
Users that are interested in train-deepseek-r1 are comparing it to the libraries listed below
Sorting:
- ☆691Updated last month
- DeepSeek 系列工作解读、扩展和复现。☆643Updated last month
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆682Updated 2 months ago
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆392Updated 2 months ago
- 一个手把手教你从零开始编写GPT并训练大语言模型的教程☆78Updated 3 months ago
- Train a 1B LLM with 1T tokens from scratch by personal☆643Updated 2 weeks ago
- A very simple GRPO implement for reproducing r1-like LLM thinking.☆1,041Updated last month
- Official Repo for Open-Reasoner-Zero☆1,916Updated last month
- Distributed RL System for LLM Reasoning☆1,248Updated 2 weeks ago
- Collect every awesome work about r1!☆363Updated 2 weeks ago
- minimal-cost for training 0.5B R1-Zero☆719Updated this week
- A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.☆802Updated 2 months ago
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆847Updated 2 weeks ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆236Updated 6 months ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,234Updated this week
- Reproduce R1 Zero on Logic Puzzle☆2,337Updated last month
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆472Updated last month
- Build & Optimize your RAG.☆647Updated this week
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆521Updated 3 weeks ago
- Scalable RL solution for advanced reasoning of language models☆1,552Updated last month
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆430Updated last month
- Unsloth Fine-tuning Notebooks for Google Colab, Kaggle, Hugging Face and more.☆311Updated this week
- 中文翻译的 Hands-On-Large-Language-Models (hands-on-llms),动手学习大模型☆881Updated 3 months ago
- ☆148Updated 2 weeks ago
- Awesome Reasoning LLM Tutorial/Survey/Guide☆1,605Updated last month
- 从零实现一个小参数量中文大语言模型。☆629Updated 8 months ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,767Updated 3 months ago
- Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents☆454Updated 4 months ago
- 企业级RAG系统从入门到精通☆456Updated 2 months ago
- LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.☆767Updated this week