使用多轮对话数据集对deepseek进行lora微调教程
☆61Dec 26, 2024Updated last year
Alternatives and similar repositories for deepseek-llm-7B-chat-lora-ft
Users that are interested in deepseek-llm-7B-chat-lora-ft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 通过lora对deepseek小模型进行微调☆22Nov 15, 2024Updated last year
- PyTorch impelementation for "Federated Recommendation via Hybrid Retrieval Augmented Generation".☆23Mar 8, 2024Updated 2 years ago
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆22Mar 10, 2025Updated last year
- 爬取新浪财经网http://finance.sina.com.cn/stock/,各股票公司每日公告(爬取股票分析所需语料)☆29Aug 9, 2017Updated 8 years ago
- 基于舆情中文核心论文的deepsearch项目☆15Apr 1, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- ☆11Mar 1, 2016Updated 10 years ago
- Repository for the Findings of ACL'23 paper Label Agnostic Pre-training for Zero-shot Text Classification☆12Aug 10, 2023Updated 2 years ago
- Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".☆16Oct 25, 2023Updated 2 years ago
- CBLUE2.0-关系抽取模型,基于pytorch☆18Oct 23, 2024Updated last year
- 以【电商购物支付】作为当前分布式项目的业务功能,通过该项目完整实现并解决分布式服务下的【分布式事务】问题☆17Apr 29, 2018Updated 8 years ago
- ☆14Apr 4, 2025Updated last year
- Codes for paper : "A Stroke-based RNN for Writer-Independent Online Signature Verification"☆11May 6, 2019Updated 7 years ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Oct 8, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- collected real time walking data(patterns) with gyroscope, use Fast Fourier Transformation to extract the clustering features, and build …☆10Mar 2, 2020Updated 6 years ago
- Demo code showing how to use Java's StructuredTaskScope☆11Jun 9, 2026Updated 2 weeks ago
- RQAlpha 对接 futuquant 的扩展 Mod。通过启用该 Mod 来实现港股和美股交易策略的实盘交易。☆13Sep 13, 2017Updated 8 years ago
- 统计美股近几年涨幅特别大的股票,在A股找到相关的股票☆13Jan 31, 2016Updated 10 years ago
- Machine learning strategy that trains the model using "everything and the kitchen sink": fundamentals, technical indicators, returns, pri…☆14Apr 23, 2024Updated 2 years ago
- 云开发AI能力示例项目(小程序)☆14Feb 17, 2025Updated last year
- 完成了《实战Google深度学习框架》里的内容☆20Oct 6, 2018Updated 7 years ago
- 使用LangGraph搭建多智能体客服系统☆50May 26, 2026Updated last month
- 淘宝、京东宝贝价格分析助手,一键保存当前所收藏宝贝的价格实时价格,以后购买前可分析当前价格浮动,便于理性购买!☆12Dec 7, 2017Updated 8 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 掼蛋AI☆13Oct 18, 2020Updated 5 years ago
- 监听mysql binlog实时生成GAE使用的增量索引☆11Jan 24, 2018Updated 8 years ago
- 微信抢红包,支持后台通知和聊天界面抢红包☆13Mar 8, 2017Updated 9 years ago
- 利用文字信息生成文字动画视频☆17Apr 14, 2022Updated 4 years ago
- Using Baidu ASR auto-generating subtitles for any video file. 使用百度短语音识别技术为视频或音频生成字幕。☆12Jan 23, 2022Updated 4 years ago
- 美国股票爬取(NASDAQ,AMEX,NYSE)☆16Nov 24, 2016Updated 9 years ago
- 由于BAAI/bge-large-zh 在Hugging Face Clone不下来,手动下载下来,便于使用☆11Sep 16, 2023Updated 2 years ago
- 菜菜的Java进阶之路:记录Java进阶相关知识、笔记以及案例,包括:JVM、并发编程、MySQL进阶、常用中间件等😆☆12Jun 9, 2025Updated last year
- A Demonstration Of Vert.x ClusterIng And Kubernetes Superpowers☆10Oct 13, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 学习vLLM,使用vLLM部署Qwen2-0.5B的模型,并使用docker部署。☆20Jun 22, 2024Updated 2 years ago
- ControlNet with Txt2Img | Img2Img | + Multiple LoRAs, All in one jupyter notebook for Flux.1 dev. Able to run on Google Colab Free Tier