828Tina/deepseek-llm-7B-chat-lora-ft

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/828Tina/deepseek-llm-7B-chat-lora-ft)

828Tina / deepseek-llm-7B-chat-lora-ft

使用多轮对话数据集对deepseek进行lora微调教程

☆61

Alternatives and similar repositories for deepseek-llm-7B-chat-lora-ft

Users that are interested in deepseek-llm-7B-chat-lora-ft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sunshine-JLU / deepseek-r1-distill-llama-8b-lora
View on GitHub
The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.
☆17Feb 19, 2025Updated last year
mattppal / Chat-with-Claude-Sonnet-35
View on GitHub
This is a simple guide to help you build an Anthropic Claude Sonnet 3.5 chatbot interface with Gradio
☆13Jun 23, 2024Updated 2 years ago
zjmitxwz / rlsb
View on GitHub
python人脸识别和情绪识别
☆17Oct 2, 2023Updated 2 years ago
997261095 / point-generate
View on GitHub
指针生成网络在中英文数据集下的应用
☆16Mar 10, 2020Updated 6 years ago
shengtaovvv / Dialogue
View on GitHub
本项目由三个模块构成。意图识别：判断用户的意图是业务型还是闲聊型；模型检索：该部分构建一个语料库，当用户发起新的query（通过意图识别判断为业务型对话）时，为用户匹配query检索的最佳response，使用HSWN进行召回（粗排），然后构建句子的相似度，并利用Lig…
☆12Feb 18, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zhouxy2003 / cn_sentiment_an
View on GitHub
基于 BERT 的中文情感分类任务如何使用 transformers 库和相关工具实现情感分析任务。脚本基于预训练的 BERT 模型（bert-base-chinese），对文本进行分类，标签为正面（positive）、负面（negative）和中性（neutral）。
☆46Oct 18, 2025Updated 9 months ago
CodeDuoGun / deepseek_lora
View on GitHub
基于deepseek、qwen3大模型，lora sft 医疗行业数据
☆15Apr 10, 2026Updated 3 months ago
yinhao0214 / sinaFinanceSpider
View on GitHub
爬取新浪财经网http://finance.sina.com.cn/stock/，各股票公司每日公告（爬取股票分析所需语料）
☆29Aug 9, 2017Updated 8 years ago
zerowsir / stock_study
View on GitHub
股票相关知识学习，用Python来研究一下股票投资，大致会包括股票数据的爬取、技术指标分析、量化交易到神经网络（深度学习）
☆17Aug 3, 2019Updated 6 years ago
xiangruihu / bilibili
View on GitHub
☆15Aug 3, 2025Updated 11 months ago
dotXem / GLYFE
View on GitHub
Benchmark of glucose predictive models in diabetes
☆11Nov 12, 2024Updated last year
pdsuwwz / vite-pinia-starter
View on GitHub
🐝 基础迭代模板 Starter Example 🍍 Pinia + Vue3 + Vite 5 + Element-Plus 2 + ESLint(v9) + Axios + Sass 基于 useLocale 实现 i18n 路由级别国际化语言切换
☆11Updated this week
stay-leave / DeepSearchAcademic
View on GitHub
基于舆情中文核心论文的deepsearch项目
☆15Apr 1, 2025Updated last year
Kirovsiki / DeepSeek_R1_LoraTrain
View on GitHub
用于训练中文DeepSeek R1大模型的Lora脚本
☆13Mar 20, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ronek22 / runningCalculator
View on GitHub
Console app that calculates interesting things for runners like pace, time,long run distance, vdot, paces for vdot
☆14Sep 30, 2024Updated last year
xiby / FedHealth
View on GitHub
An implementation of FedHealth
☆11May 26, 2021Updated 5 years ago
zhkai / MTSF-DG
View on GitHub
Time Series Forecasting with Dynamic Graph Modeling
☆16Aug 31, 2025Updated 10 months ago
jacy0201 / zb-pay-dubbo
View on GitHub
以【电商购物支付】作为当前分布式项目的业务功能，通过该项目完整实现并解决分布式服务下的【分布式事务】问题
☆17Apr 29, 2018Updated 8 years ago
liuchen6667 / qwen2.5_sft_kd
View on GitHub
对qwen2.5进行微调以及知识蒸馏
☆17Dec 24, 2024Updated last year
zysNLP / quickllm
View on GitHub
A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …
☆47Oct 8, 2025Updated 9 months ago
colinzhu / web-console
View on GitHub
A super simple java tool that allows you to run a task from a web browser and see the output in real time.
☆12Mar 3, 2025Updated last year
Hyperclaw79 / StocksALot
View on GitHub
StocksALot is a cutting edge PoC for Stock Market Analysis employing OpenAI's GPT LLMs for insight inference.
☆12Dec 6, 2023Updated 2 years ago
eric52zhang / cmliuedge
View on GitHub
edgetunnel参考cmliu大佬的项目并混淆
☆15Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
taishan1994 / Qwen2-UIE
View on GitHub
基于Qwen2模型进行通用信息抽取【实体/关系/事件抽取】
☆41Jul 10, 2024Updated 2 years ago
anuradha1992 / HEAL
View on GitHub
Code and the dataset for HEAL: A Knowledge Graph for Distress Management Conversations
☆23Nov 5, 2024Updated last year
kaihhe / tianchi-diabetes-challenge
View on GitHub
天池精准医疗大赛，糖尿病预测
☆11Jul 13, 2018Updated 8 years ago
hl845740757 / disruptor2
View on GitHub
重写LMAX的Disruptor，更好的接口，更好的扩展性
☆10Mar 20, 2026Updated 4 months ago
AdamPlatin123 / Docling-webui
View on GitHub
☆12May 20, 2025Updated last year
Breeze648 / WeakWater-30M
View on GitHub
本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型，涵盖预训练、有监督微调（SFT）和R1推理蒸馏三个阶段。项目采用自定义Transformer架构（包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码），实现高效的长文本处理和…
☆23Mar 10, 2025Updated last year
taishan1994 / baichuan-Qlora-Tuning
View on GitHub
基于qlora对baichuan-7B大模型进行指令微调。
☆22Jun 22, 2023Updated 3 years ago
faruto / rqalpha-mod-futu
View on GitHub
RQAlpha 对接 futuquant 的扩展 Mod。通过启用该 Mod 来实现港股和美股交易策略的实盘交易。
☆13Sep 13, 2017Updated 8 years ago
Bryce-come / Qwen3-MS
View on GitHub
实战-医疗大模型微调
☆20Sep 21, 2025Updated 10 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ynwynw / bookManage-public
View on GitHub
springboot的前后端分离的图书管理系统项目。后端使用Java+SpringBoot+MyBatis+MySQL 前端使用Vue+Axios+Element UI
☆18Feb 21, 2023Updated 3 years ago
JozeOu / des-algorithm
View on GitHub
C 语言实现 DES 算法
☆17Nov 2, 2018Updated 7 years ago
mysiga / RedWallet
View on GitHub
微信抢红包，支持后台通知和聊天界面抢红包
☆13Mar 8, 2017Updated 9 years ago
SenhLinsh / PriceAnalysisAssistant
View on GitHub
淘宝、京东宝贝价格分析助手，一键保存当前所收藏宝贝的价格实时价格，以后购买前可分析当前价格浮动，便于理性购买！
☆12Dec 7, 2017Updated 8 years ago
fabbrimatteo / VHA
View on GitHub
This repository contains the source code related to the paper Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation
☆11Jun 23, 2020Updated 6 years ago
CXR-AL14 / CXR-Code
View on GitHub
☆12Sep 23, 2022Updated 3 years ago
myh-1302 / Multimodal-emotion
View on GitHub
该系统通过融合文本、音频和视频数据，实现了情绪状态的准确识别，涉及的关键技术包括BERT模型、YOLOv8目标检测和多模态学习算法，为情感交互提供了强有力的技术支撑。
☆23Sep 8, 2024Updated last year