sunshine-JLU / deepseek-r1-distill-llama-8b-lora
The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.
☆15Updated 2 months ago
Alternatives and similar repositories for deepseek-r1-distill-llama-8b-lora:
Users that are interested in deepseek-r1-distill-llama-8b-lora are comparing it to the libraries listed below
- ☆38Updated last month
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆22Updated 9 months ago
- fine-tune deepseek r1☆119Updated 2 months ago
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆52Updated 3 months ago
- 基于qwenvl微调一个多模态Xray识别的大模型☆14Updated 6 months ago
- 使用煤矿历史事故案例,事故处理报告、安全规程规章制度、技术文档、煤矿从业人员入职考试题库等数据,微调internlm2模型实现针对煤矿事故和煤矿安全知识的智能问答。☆47Updated 3 months ago
- run chatglm3-6b in BM1684X☆38Updated last year
- 视频理解:千问视频多模态模型 & Dify☆52Updated 7 months ago
- ☆26Updated 6 months ago
- 一些大语言模型和多模态模型的应用,主要包括Rag,小模型,Agent,跨模态搜索,OCR等等☆164Updated 5 months ago
- Here is a demo for PDF parser (Including OCR, object detection tools)☆34Updated 6 months ago
- 基于ChatGLM3基座模型和LLAMA-Factory框架进行微调的一个中医问答机器人☆85Updated last year
- bisheng model services backend☆27Updated 9 months ago
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆59Updated 8 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆57Updated 11 months ago
- ☆58Updated 6 months ago
- 使用FastAPI+vLLM部署Qwen2.5☆12Updated 6 months ago
- Qwen-Efficient-Tuning☆43Updated last year
- 集成了LLM与SDXL的AIGC应用程序☆27Updated last year
- RAG向量召回示例☆123Updated last year
- 拼好RAG:手搓并融合了GraphRAG、LightRAG、Neo4j-llm-graph-builder进行知识图谱构建以及搜索;整合DeepSearch技术实现私域RAG的推理;自制针对GraphRAG的评估框架| Integrate GraphRAG, LightRA…☆103Updated this week
- 本项目致力于为大模型领域的初学者提供全面的知识体系,包括基础和高阶内容,以便开发者能迅速掌握大模型技术栈并全面了解相关知识。☆56Updated 3 months ago
- qwen ai agent☆130Updated last year
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Updated last year
- 通义千问的DPO训练☆47Updated 7 months ago
- 眼科问诊大模型☆90Updated 9 months ago
- mcp的webui界面,支持客户端连接多个sse服务端,支持 openai、deepseek、qwen等大模型,另外附上构建的 agent的 stdio和sse的简单 天气查询的完整示例☆14Updated this week
- ✏️0成本LLM微调上手项目,⚡️一步一步使用colab训练法律LLM,基于microsoft/phi-1_5、chatglm3,包含lora微调,全参微调☆72Updated last year
- 基于Llamaindex微调qwen2.5-7b☆22Updated 4 months ago
- Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SO…☆76Updated 3 months ago