基于qlora对baichuan-7B大模型进行指令微调。
☆23Jun 22, 2023Updated 2 years ago
Alternatives and similar repositories for baichuan-Qlora-Tuning
Users that are interested in baichuan-Qlora-Tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Baichuan-13B 指令微调☆90Jul 14, 2023Updated 2 years ago
- 使用指令微调对大模型进行微调。☆11Jun 28, 2023Updated 2 years ago
- Finetune baichuan pretrained model with QLora method☆16Jul 13, 2023Updated 2 years ago
- 使用LoRA对ChatGLM进行微调。☆49Jun 26, 2023Updated 2 years ago
- 以InternLM2-chat-7为基座模型,以常用中药等为数据集,微调的大模型。中医聊天小助手。☆17Feb 29, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆89Jun 27, 2023Updated 2 years ago
- 该仓库主要描述了CCAC2023多模态对话情绪识别评测第3名的实现过程☆12Aug 11, 2024Updated last year
- An automated pipeline for scraping, processing, and visualizing medical Q&A data to build high-quality datasets. Includes a comprehensive…☆24Dec 24, 2024Updated last year
- python人脸识别和情绪识别☆17Oct 2, 2023Updated 2 years ago
- SwornDisk是一个面向可信执行环境的、基于日志结构的安全块设备(全国大学生操作系统比赛2022)☆24Aug 14, 2022Updated 3 years ago
- baichuan LLM surpervised finetune by lora☆64Jun 28, 2023Updated 2 years ago
- 该项目专注于识别智能对话场景中的用户文本,自动判断情绪类别并给出相应的准确度。可以广泛应用于社交媒体评论情感分析、智能客服情绪分析等场景,成为情感支持工具,帮助用户 从情绪中解脱。多次Prompt提升后,GPT模型最终识别准确率高于人类Baseline水准。☆11Jul 25, 2023Updated 2 years ago
- 对qwen2.5进行微调以及知识蒸馏☆17Dec 24, 2024Updated last year
- ☆11Apr 20, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Constrained learning using boxes for event-event relation extraction☆12Aug 5, 2022Updated 3 years ago
- 手写一个迷你版本的Tomcat,实现了静态、动态资源的访问。☆10Dec 27, 2020Updated 5 years ago
- A benchmark for assessing the strength of causal relationships between real-world events (EMNLP 2023).☆15Nov 23, 2023Updated 2 years ago
- 使用ACE2005创建以事件和实体为节点的事件知识图谱,用于智能问答☆18Feb 29, 2020Updated 6 years ago
- Convert DeepBind models to Keras☆12Jul 15, 2018Updated 7 years ago
- ☆10Sep 7, 2022Updated 3 years ago
- pytorch bert 版的 multi_label_text_classification☆10Dec 28, 2019Updated 6 years ago
- Python常见工具集合-繁简转换/繁体转换; 词频统计;☆20May 8, 2017Updated 8 years ago
- The project is based on YoloV3 and PyTorch to detect the national flag in the picture.☆11Aug 3, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 用于训练中文DeepSeek R1大模型的Lora脚本☆13Mar 20, 2025Updated last year
- 🚀 轻量视频🎥 大模型🤖☆22Apr 27, 2025Updated 11 months ago
- 中文突发事件语料库(Chinese Emergency Corpus)-上海大学-语义智能实验室☆12Apr 21, 2021Updated 4 years ago
- ☆11May 2, 2023Updated 2 years ago
- 曙光 h5 SDK,隶属于 https://github.com/eventtracing/dawn 项目☆15Mar 15, 2023Updated 3 years ago
- My solutions of the Titanic competition of Kaggle https://www.kaggle.com/c/titanic☆10May 8, 2022Updated 3 years ago
- Therapixel solution of 2017's kaggle challenge on lung cancer detection☆14Feb 9, 2018Updated 8 years ago
- This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary ex…☆24May 5, 2025Updated 11 months ago
- [EMNLP'22] Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset☆20Apr 4, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆30Jun 23, 2025Updated 9 months ago
- Combination of Yolov8 and Swin-Transformer☆15Sep 27, 2025Updated 6 months ago
- ☆14Jun 20, 2022Updated 3 years ago
- Used LSTM and Graph Attention Mechanism to detect the causal relationship in a sentence☆12May 14, 2023Updated 2 years ago
- 多模态情绪识别☆30Aug 11, 2023Updated 2 years ago
- Ziya-LLaMA-13B是IDEA基于LLaMa的130亿参数的大规模预训练模型,具备翻译,编程,文本分类,信息抽取,摘要,文案生成,常识问答和数学计算等能力。目前姜子牙通用大模型已完成大规模预训练、多任务有监督微调和人类反馈学习三阶段的训练过程。本文主要用于Ziya-…☆46Jun 9, 2023Updated 2 years ago
- 一个基于大模型微调的中文医疗问答机器人应用☆25Jan 11, 2024Updated 2 years ago