ssbuild / chatglm_rlhfLinks

chatglm_rlhf_finetuning

☆30

Alternatives and similar repositories for chatglm_rlhf

Users that are interested in chatglm_rlhf are comparing it to the libraries listed below

Sorting:

ssbuild / deep_training
deep learning
☆148Updated 5 months ago
taishan1994 / qlora-chinese-LLM
使用qlora对中文大语言模型进行微调，包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE
☆90Updated 2 years ago
ssbuild / moss_finetuning
moss chat finetuning
☆51Updated last year
Miraclemarvel55 / ChatGLM-RLHF
对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF
☆195Updated 2 years ago
yongzhuo / chatglm-maths
chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu
☆164Updated 2 years ago
yongzhuo / ChatGLM2-SFT
ChatGLM2-6B微调, SFT/LoRA, instruction finetune
☆110Updated 2 years ago
SupritYoung / RLHF-Label-Tool
用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.
☆254Updated 2 years ago
ProjectD-AI / llama_inference
llama inference for tencentpretrain
☆99Updated 2 years ago
X-PLUG / ChatPLUG
A Chinese Open-Domain Dialogue System
☆324Updated 2 years ago
LC1332 / Luotuo-QA
骆驼QA，中文大语言阅读理解模型。
☆75Updated 2 years ago
vxfla / kanchil
Kanchil（鼷鹿）是世界上最小的偶蹄目动物，这个开源项目意在探索小模型（6B以下）是否也能具备和人类偏好对齐的能力。
☆113Updated 2 years ago
llmeval / LLMEval-1
中文大语言模型评测第一期
☆110Updated last year
Suffoquer-fang / LuXun-GPT
LLM with LuXun (鲁迅) style
☆86Updated 2 years ago
AtomEcho / AtomBulb
旨在对当前主流LLM进行一个直观、具体、标准的评测
☆94Updated 2 years ago
Longyichen / Alpaca-family-library
Summarize all open source Large Languages Models and low-cost replication methods for Chatgpt.
☆137Updated 2 years ago
sufengniu / RefGPT
☆163Updated 2 years ago
FlagOpen / FlagInstruct
☆172Updated 2 years ago
yangjianxin1 / LLMPruner
☆309Updated 2 years ago
thaumstrial / FinetuneGLMWithPeft
Simple implementation of using lora form the peft library to fine-tune the chatglm-6b
☆84Updated 2 years ago
CSHaitao / ChatGLM_mutli_gpu_tuning
deepspeed+trainer简单高效实现多卡微调大模型
☆129Updated 2 years ago
shibing624 / lmft
ChatGLM-6B fine-tuning.
☆136Updated 2 years ago
zhangnn520 / chinese_llama_alpaca_lora
llama信息抽取实战
☆100Updated 2 years ago
georgechen1827 / ChatGLM-text-embedding
use chatGLM to perform text embedding
☆45Updated 2 years ago
sunzeyeah / RLHF
Implementation of Chinese ChatGPT
☆288Updated last year
OpenLMLab / OpenChineseLLaMA
Chinese large language model base generated through incremental pre-training on Chinese datasets
☆238Updated 2 years ago
yangzhipeng1108 / DeepSpeed-Chat-ChatGLM
☆44Updated last year
FreedomIntelligence / InstructionZoo
☆281Updated last year
LC1332 / CamelBell-Chinese-LoRA
CamelBell（驼铃) is be a Chinese Language Tuning project based on LoRA. CamelBell is belongs to Project Luotuo(骆驼), an open sourced Chinese-…
☆171Updated last year
zejunwang1 / LLMTuner
大语言模型指令调优工具（支持 FlashAttention）
☆178Updated last year
FudanNLPLAB / CBook-150K
中文图书语料MD5链接
☆217Updated last year