owenliang / DeepSeek-Distill-Qwen-For-Child
☆38Updated last month
Alternatives and similar repositories for DeepSeek-Distill-Qwen-For-Child:
Users that are interested in DeepSeek-Distill-Qwen-For-Child are comparing it to the libraries listed below
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆57Updated 11 months ago
- 通义千问的DPO训练☆46Updated 7 months ago
- qwen ai agent☆130Updated last year
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆59Updated 8 months ago
- 使用煤矿历史事故案例,事故处理报告、安全规程规章制度、技术文档、煤矿从业人员入职考试题库等数据,微调internlm2模型实现针对煤矿事故和煤矿安全知识的智能问答。☆47Updated 3 months ago
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆22Updated 3 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆22Updated 9 months ago
- 本项目主要介绍prompt工程相关用例。包括模拟智能推荐客服系统构建和问答、思维链、自洽性、思维树等相关进阶demo,旨在帮助大家理解prompt。通过一份代码实现了同时支持多种大模型(如OpenAI、阿里通义千问等)并使用FastAPI对应用进行API封装。☆29Updated 6 months ago
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆52Updated 3 months ago
- ☆140Updated 11 months ago
- qwen models finetuning☆97Updated last month
- 大模型检索增强生成技术最佳实践。☆73Updated 7 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- 大语言模型训练和服务调研☆37Updated last year
- A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬☆59Updated last month
- The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.☆15Updated 2 months ago
- 基于Qwen2模型进行通用信息抽取【实体/关系/事件抽取】☆31Updated 9 months ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆43Updated last month
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆54Updated 7 months ago
- LLM Tokenizer with BPE algorithm☆31Updated 11 months ago
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆195Updated 11 months ago
- Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集☆157Updated last year
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 4 months ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆107Updated last year
- simple decoder-only GTP model in pytorch☆39Updated 11 months ago
- ☆26Updated 6 months ago
- ThinkLLM:🚀 轻量、高效的大语言模型算法实现☆37Updated last week
- 使用单个24G显卡,从0开始训练LLM☆53Updated 6 months ago
- Qwen-Efficient-Tuning☆43Updated last year
- 本项目致力于为大模型领域的初学者提供全面的知识体系,包括基础和高阶内容,以便开发者能迅速掌握大模型技术栈并全面了解 相关知识。☆56Updated 3 months ago