sunshine-JLU / deepseek-r1-distill-llama-8b-lora
The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.
☆16Updated 2 months ago
Alternatives and similar repositories for deepseek-r1-distill-llama-8b-lora
Users that are interested in deepseek-r1-distill-llama-8b-lora are comparing it to the libraries listed below
Sorting:
- ☆40Updated 2 months ago
- run chatglm3-6b in BM1684X☆38Updated last year
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆54Updated 3 months ago
- fine-tune deepseek r1☆120Updated 3 months ago
- ☆78Updated this week
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆24Updated 4 months ago
- 基于Llamaindex微调qwen2.5-7b☆23Updated 4 months ago
- 利用多Agent对区域进行地址提取☆27Updated last month
- 本项目旨在提供一个微调酒店推荐垂直领域大模型并应用的完整闭环案例作为大家的参考案例。本项目使用的基础大模型为Qwen2.5-7B-Instruct。项目特色:完整的垂直应用案例闭环、项目源码剖析开源共享、详实的图文指导手册、手把手全流程实操演示视频☆36Updated 3 weeks ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆22Updated 9 months ago
- 基于qwenvl微调一个多模态Xray识别的大模型☆16Updated 6 months ago
- 视频理解:千问视频多模态模型 & Dify☆53Updated 8 months ago
- 本项目致力于为大模型领域的初学者提供全面的知识体系,包括基础和高阶内容,以便开发者能迅速掌握大模型技术栈并全面了解相关知识。☆58Updated 4 months ago
- 拼好RAG:手搓并融合了GraphRAG、LightRAG、Neo4j-llm-graph-builder进行知识图谱构建以及搜索;整合DeepSearch技术实现私域RAG的推理;自制针对GraphRAG的评估框架| Integrate GraphRAG, LightRA…☆155Updated this week
- 一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models…☆23Updated last week
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆59Updated 9 months ago
- Official code for Dynamic Parametric RAG.☆112Updated last week
- Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖☆38Updated 10 months ago
- baichuan-7B 微调 C++ 面试大模型☆15Updated last year
- ☆59Updated last year
- 眼科问诊大模型☆91Updated 10 months ago
- 基于LLM的多轮问答系统。结合了意图识别和词槽填充技术☆19Updated last year
- Dify 1.0 Plugin Support MCP Tools Agent strategies☆69Updated this week
- ☆58Updated 6 months ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆17Updated 8 months ago
- ☆16Updated 10 months ago
- qwen ai agent☆131Updated last year
- 本项目主要介绍prompt工程相关用例。包括模 拟智能推荐客服系统构建和问答、思维链、自洽性、思维树等相关进阶demo,旨在帮助大家理解prompt。通过一份代码实现了同时支持多种大模型(如OpenAI、阿里通义千问等)并使用FastAPI对应用进行API封装。☆30Updated 7 months ago
- The framework of training large language models,support lora, full parameters fine tune etc, define yaml to start training/fine tune of y…☆27Updated 7 months ago
- 欢迎来到“筱可AI研习社”的实战项目仓库!这个仓库主要用于存储和展示为公众号撰写的各类实战项目。我们会不断优化和迭代这些项目,以探索AI的无限可能。☆34Updated last week