WALLE-AI / uReasoningLLMsLinks
Deepseek-r1复现科普与资源汇总
☆22Updated 11 months ago
Alternatives and similar repositories for uReasoningLLMs
Users that are interested in uReasoningLLMs are comparing it to the libraries listed below
Sorting:
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆43Updated last year
- 大语言模型训练和服务调研☆37Updated 2 years ago
- LLM+RAG for QA☆22Updated 2 years ago
- Fast instruction tuning with Llama2☆11Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆71Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- Max的有趣数据集 / Max's awesome datasets☆61Updated 5 months ago
- 天池算法比赛《BetterMixture - 大模型数据混合挑 战赛》的第一名top1解决方案☆34Updated last year
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆10Updated last year
- Code for Robust Fine-tuning (RbFT)☆17Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated last year
- 数据合成工具,简单高效的合成不同业务场景的大模型训练数据☆38Updated last year
- Code for "An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought"☆17Updated last year
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆100Updated last year
- ☆28Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- 怎么训练一个LLM分词器☆153Updated 2 years ago
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆29Updated last week
- ☆36Updated last year
- Music large model based on InternLM2-chat.☆23Updated last year
- ☆25Updated 9 months ago
- ☆125Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆65Updated last year
- This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary ex…☆24Updated 9 months ago
- ☆13Updated 10 months ago
- This repository provides an implementation of "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction B…☆86Updated 6 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆48Updated last year
- ☆51Updated last year