使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE
☆88Jun 27, 2023Updated 2 years ago
Alternatives and similar repositories for qlora-chinese-LLM
Users that are interested in qlora-chinese-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。☆359Aug 22, 2023Updated 2 years ago
- 面向金融领域的小样本跨类迁移事件抽取 第三名 方案及代码☆17Dec 23, 2020Updated 5 years ago
- moss chat finetuning☆51Apr 23, 2024Updated 2 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆23Jun 22, 2023Updated 2 years ago
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆621Jan 24, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Source code of ACL 2023 accepted paper "AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression"☆13Jun 14, 2023Updated 2 years ago
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 4 years ago
- ☆11May 2, 2023Updated 3 years ago
- 2021搜狐校园文本匹配算法大赛☆16Jun 4, 2021Updated 4 years ago
- 基于规则匹配的问答系统中的解析器,the parser of based rule QA system☆12Mar 13, 2020Updated 6 years ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调☆3,724Oct 12, 2023Updated 2 years ago
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55May 17, 2023Updated 3 years ago
- 大语言模型微调的项目,包含了使用QLora微调ChatGLM和LLama☆29Jun 26, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 大语言模型指令调优工具(支持 FlashAttention)☆177Jan 4, 2024Updated 2 years ago
- 本项目采用Keras和ALBERT实现文本多标签分类任务,其中对ALBERT进行微调。☆13Jan 5, 2021Updated 5 years ago
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,659Jul 18, 2024Updated last year
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability☆417Jun 1, 2023Updated 2 years ago
- DataFountain第五届达观杯第4名方案☆11Dec 3, 2021Updated 4 years ago
- ☆12Nov 14, 2024Updated last year
- chatglm 6b finetuning and alpaca finetuning☆1,535Mar 9, 2025Updated last year
- ☆14May 25, 2023Updated 2 years ago
- 一套代码指令微调大模型☆39Aug 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 文本去重☆78May 23, 2024Updated last year
- using lear to do ner extraction☆29Mar 13, 2022Updated 4 years ago
- 国内首个全参数训练的法律大模型 HanFei-1.0 (韩非)☆130Oct 28, 2023Updated 2 years ago
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆52Jun 25, 2023Updated 2 years ago
- Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调☆308May 3, 2026Updated 2 weeks ago
- GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。☆93Feb 28, 2023Updated 3 years ago
- ☆10Mar 16, 2023Updated 3 years ago
- 抽取式NLP模型(阅读理解模型,MRC)实现词义消歧(WSD)☆14May 10, 2022Updated 4 years ago
- 复现论文《Distilling Task-Specific Knowledge from BERT into Simple Neural Networks》☆16Jun 13, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- huggingface ChineseBert Tokenizer☆17Apr 16, 2022Updated 4 years ago
- 中文原生工业测评基准☆15Mar 21, 2024Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,908Jun 10, 2024Updated last year
- DeepEE: Deep Event Extraction Algorithm Gallery (基于深度学习的开源中文事件抽取算法汇总)☆43Dec 11, 2022Updated 3 years ago
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,777Dec 12, 2023Updated 2 years ago
- 探索中文instruct数据在ChatGLM, LLaMA上的微调表现☆389Apr 4, 2023Updated 3 years ago
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago