使用多轮对话数据集对deepseek进行lora微调教程
☆60Dec 26, 2024Updated last year
Alternatives and similar repositories for deepseek-llm-7B-chat-lora-ft
Users that are interested in deepseek-llm-7B-chat-lora-ft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.☆17Feb 19, 2025Updated last year
- ACwing算法基础课笔记☆11May 15, 2023Updated 3 years ago
- ☆10Feb 17, 2022Updated 4 years ago
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆22Mar 10, 2025Updated last year
- 爬取新浪财经网http://finance.sina.com.cn/stock/,各股票公司每日公告(爬取股票分析所需语料)☆28Aug 9, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 基于舆情中文核心论文的deepsearch项目☆15Apr 1, 2025Updated last year
- 深度学习可解释性论文汇总☆15Mar 3, 2021Updated 5 years ago
- 基于 BERT 的中文情感分类任务 如何使用 transformers 库和相关工具实现情感分析任务。脚本基于预训练的 BERT 模型(bert-base-chinese),对文本进行分类,标签为正面(positive)、负面(negative)和中性(neutral)。☆46Oct 18, 2025Updated 7 months ago
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- Benchmark of glucose predictive models in diabetes☆11Nov 12, 2024Updated last year
- ☆11Mar 1, 2016Updated 10 years ago
- 🐝 基础迭代模板 Starter Example 🍍 Pinia + Vue3 + Vite 5 + Element-Plus 2 + ESLint(v9) + Axios + Sass 基于 useLocale 实现 i18n 路由级别国际化语言切换☆11Updated this week
- Console app that calculates interesting things for runners like pace, time,long run distance, vdot, paces for vdot☆14Sep 30, 2024Updated last year
- Repository for the Findings of ACL'23 paper Label Agnostic Pre-training for Zero-shot Text Classification☆12Aug 10, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".☆16Oct 25, 2023Updated 2 years ago
- 中科大2023春-智能计算系统☆11Jul 18, 2023Updated 2 years ago
- DCNv2_torch1.11☆11Sep 27, 2022Updated 3 years ago
- An implementation of FedHealth☆11May 26, 2021Updated 5 years ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Oct 8, 2025Updated 8 months ago
- Codes for paper : "A Stroke-based RNN for Writer-Independent Online Signature Verification"☆11May 6, 2019Updated 7 years ago
- Efficient-GlobalPointer的关系抽取任务☆24Jan 27, 2022Updated 4 years ago
- ☆20Jul 8, 2024Updated last year
- collected real time walking data(patterns) with gyroscope, use Fast Fourier Transformation to extract the clustering features, and build …☆10Mar 2, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Demo code showing how to use Java's StructuredTaskScope☆11Dec 10, 2025Updated 5 months ago
- RQAlpha 对接 futuquant 的扩展 Mod。通过启用该 Mod 来实现港股和美股交易策略的实盘交易。☆13Sep 13, 2017Updated 8 years ago
- dbnet文字检测,添加文本框分类☆14Jul 27, 2022Updated 3 years ago
- 统计美股近几年涨幅特别大的股票,在A股找到相关的股票☆13Jan 31, 2016Updated 10 years ago
- Simple implementation of Retrieval-Augmented Generation System☆28Oct 24, 2024Updated last year
- Machine learning strategy that trains the model using "everything and the kitchen sink": fundamentals, technical indicators, returns, pri…☆14Apr 23, 2024Updated 2 years ago
- Code for ACL 2022 main conference paper "Neural Machine Translation with Phrase-Level Universal Visual Representations".☆21Oct 25, 2023Updated 2 years ago
- 天池精准医疗大赛,糖尿病预测☆11Jul 13, 2018Updated 7 years ago
- 云开发AI能力示例项目(小程序)☆14Feb 17, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- StocksALot is a cutting edge PoC for Stock Market Analysis employing OpenAI's GPT LLMs for insight inference.☆11Dec 6, 2023Updated 2 years ago
- 爱淘优惠券☆11Sep 14, 2020Updated 5 years ago
- 完成了《实战Google深度学习框架》里的内容☆20Oct 6, 2018Updated 7 years ago
- Open-source API for financial data. Get quotes, historical data, technical indicators, and more.☆36Updated this week
- 使用LangGraph搭建多智能体客服系统☆43May 26, 2026Updated 2 weeks ago
- 重写LMAX的Disruptor,更好的接口,更好的扩展性☆10Mar 20, 2026Updated 2 months ago
- colab list for image☆19Apr 15, 2026Updated last month