Zeyi-Lin / easy-r1
Train deepseek r1-like reasoning LLM with ease | 轻松训练1个deepseek r1类的推理LLM
☆14Updated 2 months ago
Alternatives and similar repositories for easy-r1:
Users that are interested in easy-r1 are comparing it to the libraries listed below
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆52Updated 3 months ago
- 大语言模型训练和服务调研☆37Updated last year
- ThinkLLM:🚀 轻量、高效的大语言模型算法实现☆37Updated last week
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated last year
- LLM RAG 应用,支持 API 调用,语音交互。☆11Updated 10 months ago
- deepseek思维树模式实现☆14Updated 2 months ago
- 本项目致力于为大模型领域的初学者提供全面的知识体系,包括基础和高阶内容,以便开发者能迅速掌握大模型技术栈并全面了解相关知识。☆56Updated 3 months ago
- flow mirror models from JZX AI Labs☆45Updated 6 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆9Updated 4 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆37Updated 9 months ago
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆65Updated 7 months ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆55Updated 8 months ago
- An AI-powered content conversion tool that transforms text, web content, or HTML code into beautifully designed card images.一款基于AI的内容转换工…☆13Updated last week
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 7 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 4 months ago
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 2 months ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- Imitate OpenAI with Local Models☆88Updated 7 months ago
- LLM+RAG for QA☆21Updated last year
- ☆20Updated 10 months ago
- Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)☆30Updated 11 months ago
- KDD2024-WhoIsWho-Top3☆16Updated 10 months ago
- ☆140Updated 11 months ago
- Music large model based on InternLM2-chat.☆22Updated 4 months ago
- 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota☆38Updated 9 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆57Updated 11 months ago
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆59Updated 8 months ago