Zeyi-Lin / easy-r1
Train deepseek r1-like reasoning LLM with ease | 轻松训练1个deepseek r1类的推理LLM
☆12Updated last month
Alternatives and similar repositories for easy-r1:
Users that are interested in easy-r1 are comparing it to the libraries listed below
- Music large model based on InternLM2-chat.☆22Updated 3 months ago
- 大语言模型训练和服务调研☆37Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 11 months ago
- flow mirror models from JZX AI Labs☆43Updated 6 months ago
- LLM RAG 应用,支持 API 调用,语音交互。☆11Updated 9 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 6 months ago
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆50Updated 2 months ago
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆26Updated 9 months ago
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Updated last year
- Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)☆30Updated 10 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆22Updated 8 months ago
- Xtuner Factory☆33Updated last year
- ChatTTS is a generative speech model for daily dialogue.☆14Updated 5 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 3 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated 11 months ago
- ☆30Updated 3 weeks ago
- pre-training llama3 using chinese☆14Updated 11 months ago
- ☆137Updated 10 months ago
- ⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文☆11Updated 5 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆40Updated last month
- deepseek思维树模式实现☆14Updated last month
- SUS-Chat: Instruction tuning done right☆48Updated last year
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated 11 months ago
- Awesome Colab Projects Collection☆26Updated last year
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆36Updated 10 months ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated 11 months ago
- Fast pdf translate是一款pdf翻译软件,基于MinerU实现pdf转markdown的功能,接着对markdown进行分割, 送给大模型翻译,最后组装翻译结果并由pypandoc生成结果pdf。☆12Updated last week
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆47Updated this week
- 模型压缩的小白入门教程☆22Updated 8 months ago
- A more efficient GLM implementation!☆55Updated 2 years ago