Zeyi-Lin / easy-r1Links
Train deepseek r1-like reasoning LLM with ease | 轻松训练1个deepseek r1类的推理LLM
☆16Updated 4 months ago
Alternatives and similar repositories for easy-r1
Users that are interested in easy-r1 are comparing it to the libraries listed below
Sorting:
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 4 months ago
- Music large model based on InternLM2-chat.☆22Updated 6 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆63Updated last year
- SUS-Chat: Instruction tuning done right☆48Updated last year
- ⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文☆11Updated 8 months ago
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆58Updated 3 weeks ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆23Updated 11 months ago
- deepseek思维树模式实现☆15Updated 4 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 9 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆57Updated last year
- LLM+RAG for QA☆22Updated last year
- ☆22Updated 4 months ago
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated last year
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆30Updated 3 weeks ago
- accelerate generating vector by using onnx model☆17Updated last year
- Xtuner Factory☆33Updated last year
- LLM RAG 应用,支持 API 调用,语音交互。☆11Updated last year
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆34Updated 10 months ago
- llms related stuff , including code, docs☆13Updated 4 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆39Updated last year
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆38Updated 6 months ago
- ☆16Updated 11 months ago
- 大语言模型训练和服务调研☆37Updated last year
- LLM Tokenizer with BPE algorithm☆32Updated last year
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆66Updated 9 months ago
- A knowledge base backend system for LLMs with full-text search, semantic retrieval, and knowledge graph querying. Ready-to-use modules fo…☆27Updated 2 months ago
- Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)☆31Updated last year
- a toolkit on knowledge distillation for large language models☆89Updated 2 weeks ago