sunshine-JLU / deepseek-r1-distill-llama-8b-loraLinks
The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.
☆16Updated 10 months ago
Alternatives and similar repositories for deepseek-r1-distill-llama-8b-lora
Users that are interested in deepseek-r1-distill-llama-8b-lora are comparing it to the libraries listed below
Sorting:
- fine-tune deepseek r1☆125Updated 11 months ago
- ☆57Updated 10 months ago
- 基于Qwen2+SFT+DPO的医疗问答系统,项目中使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练,其次,项目还调用各种知识库工具(neo4j, milvus, LDA, 等)进行自动化训练数据生成。另外,使用 vllm 用于推理…☆48Updated last week
- 基于qwenvl微调一个多模态Xray识别的大模型☆21Updated last year
- 基于Llamaindex微调qwen2.5-7b☆34Updated last year
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆27Updated last year
- 训练自己的中文 Embedding 模型☆27Updated last year
- 基于Qwen2模型进行通用信息抽取【实体/关系/事件抽取】☆40Updated last year
- 一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models…☆76Updated 3 months ago
- ☆15Updated last year
- 大模型智能体Agent中文教程,博客代码仓库☆54Updated 2 months ago
- 此项目用于自动化采集、处理和可视化医疗问答数据,可助力构建高质量医疗问答对数据集。同时提供使用预处理后 的数据集对Qwen-7B-Chat进行微调的详细说明。☆23Updated last year
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆40Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆69Updated last year
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆60Updated 7 months ago
- 中文世界的NLP自动标注开源工具,简单样本,交给LabelFast。☆85Updated last month
- ✏️0成本LLM微调上手项目,⚡️一步一步使用colab训练法律LLM,基于microsoft/phi-1_5、chatglm3,包含lora微调,全参微调☆83Updated 2 years ago
- 视频理解:千问视频多模态模型 & Dify☆66Updated last year
- 使用煤矿历史事故案例,事故处理报告、安全规程规章制度、技术文档、煤矿从业人员入职考试题库等数据,微调internlm2模型实现针对煤矿事故和煤矿安全知识的智能问答。☆58Updated last year
- 基于LangGraph开发的智能体项目,可借助大模型自动调用工具规划旅游行程,包括景点搜索、交通查询、饭店酒店查询等功能☆38Updated last year
- 基于ChatGLM3基座模型和LLAMA-Factory框架进行微调的一个中医问答机器人☆106Updated 2 years ago
- ☆28Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆70Updated last year
- YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual fin…☆36Updated 6 months ago
- ☆28Updated last year
- baichuan-7B 微调 C++ 面试大模型☆14Updated 2 years ago
- ModelScope+Transformers+SwanLab实现Qwen-1.5-7b的指令微调任务☆23Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖☆47Updated last year
- 筱可的工程实验仓库!☆104Updated 2 months ago