taishan1994 / Chinese-LLaMA-Alpaca-LoRA-TuningLinks

使用LoRA对Chinese-LLaMA-Alpaca进行微调。

☆35

Alternatives and similar repositories for Chinese-LLaMA-Alpaca-LoRA-Tuning

Users that are interested in Chinese-LLaMA-Alpaca-LoRA-Tuning are comparing it to the libraries listed below

Sorting:

Pillars-Creation / ChatGLM-RLHF-LoRA-RM-PPO
ChatGLM-6B添加了RLHF的实现，以及部分核心代码的逐行讲解 ,实例部分是做了个新闻短标题的生成，以及指定context推荐的RLHF的实现
☆88Updated 2 years ago
CSHaitao / ChatGLM_mutli_gpu_tuning
deepspeed+trainer简单高效实现多卡微调大模型
☆129Updated 2 years ago
git-cloner / llama2-lora-fine-tuning
llama2 finetuning with deepspeed and lora
☆175Updated 2 years ago
X-jun-0130 / LLM-Pretrain-FineTune
Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调
☆285Updated last year
Miraclemarvel55 / ChatGLM-RLHF
对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF
☆196Updated 2 years ago
yuanzhoulvpi2017 / SentenceEmbedding
☆119Updated last year
stanleylsx / llms_tool
一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。
☆220Updated last year
ssbuild / llm_finetuning
Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on
☆99Updated last year
SupritYoung / RLHF-Label-Tool
用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.
☆254Updated 2 years ago
SupritYoung / Zhongjing
A Chinese medical ChatGPT based on LLaMa, training from large-scale pretrain corpus and multi-turn dialogue dataset.
☆380Updated last year
taishan1994 / Llama3.1-Finetuning
对llama3进行全参微调、lora微调以及qlora微调。
☆210Updated last year
mark1879 / Baichuan-13B-Finetuning
Baichuan-13B 指令微调
☆90Updated 2 years ago
michael-wzhu / PromptCBLUE
PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese
☆381Updated last year
jackaduma / ChatGLM-LoRA-RLHF-PyTorch
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Huma…
☆140Updated 2 years ago
jiangxinyang227 / LLM-tuning
llama，chatglm 等模型的微调
☆91Updated last year
yanqiangmiffy / how-to-train-tokenizer
怎么训练一个LLM分词器
☆154Updated 2 years ago
yongzhuo / Llama2-SFT
Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理
☆27Updated 2 years ago
sufengniu / RefGPT
☆164Updated 2 years ago
wptoux / self-instruct-zh
基于ChatGPT构建的中文self-instruct数据集
☆119Updated 2 years ago
yongzhuo / ChatGLM2-SFT
ChatGLM2-6B微调, SFT/LoRA, instruction finetune
☆110Updated 2 years ago
lansinuote / Simple_LLM_DPO
☆76Updated 2 years ago
5663015 / LLMs_train
一套代码指令微调大模型
☆38Updated 2 years ago
zejunwang1 / LLMTuner
大语言模型指令调优工具（支持 FlashAttention）
☆178Updated last year
Dai-shen / LAiW
LAiW: A Chinese Legal Large Language Models Benchmark
☆85Updated last year
WangRongsheng / Chinese-LLaMA-Alpaca-Usage
📔 对Chinese-LLaMA-Alpaca进行使用说明和核心代码注解
☆50Updated 2 years ago
MikeGu721 / EasyLLM
make LLM easier to use
☆59Updated 2 years ago
yangjianxin1 / Firefly-LLaMA2-Chinese
Firefly中文LLaMA-2大模型，支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型
☆413Updated 2 years ago
wp931120 / baichuan_sft_lora
baichuan LLM surpervised finetune by lora
☆64Updated 2 years ago
tjunlp-lab / M3KE
A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark
☆102Updated 2 years ago
muyaostudio / qwen2_seq_cls
使用 Qwen2ForSequenceClassification 简单实现文本分类任务。
☆84Updated last year