SupritYoung/RLHF-Label-Tool

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SupritYoung/RLHF-Label-Tool)

SupritYoung / RLHF-Label-Tool

用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.

☆254

Alternatives and similar repositories for RLHF-Label-Tool

Users that are interested in RLHF-Label-Tool are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SupritYoung / Zhongjing
View on GitHub
A Chinese medical ChatGPT based on LLaMa, training from large-scale pretrain corpus and multi-turn dialogue dataset.
☆398Dec 12, 2023Updated 2 years ago
hiyouga / FastEdit
View on GitHub
🩹Editing large language models within 10 seconds⚡
☆1,370Aug 13, 2023Updated 2 years ago
hiyouga / ChatGLM-Efficient-Tuning
View on GitHub
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
☆3,720Oct 12, 2023Updated 2 years ago
the-seeds / imitater
View on GitHub
Imitate OpenAI with Local Models
☆91Aug 27, 2024Updated last year
LianjiaTech / BELLE
View on GitHub
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
☆8,279Oct 16, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
OpenLMLab / MOSS-RLHF
View on GitHub
Secrets of RLHF in Large Language Models Part I: PPO
☆1,426Mar 3, 2024Updated 2 years ago
TigerResearch / TigerBot
View on GitHub
TigerBot: A multi-language multi-task LLM
☆2,259Dec 28, 2024Updated last year
shibing624 / MedicalGPT
View on GitHub
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
☆5,667Jun 3, 2026Updated last month
PKU-Alignment / safe-rlhf
View on GitHub
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
☆1,611Nov 24, 2025Updated 8 months ago
Miraclemarvel55 / ChatGLM-RLHF
View on GitHub
对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF
☆196May 23, 2023Updated 3 years ago
seanzhang-zhichen / baichuan-Dynamic-NTK-ALiBi
View on GitHub
百川Dynamic NTK-ALiBi的代码实现：无需微调即可推理更长文本
☆49Aug 27, 2023Updated 2 years ago
Abbey4799 / CuteGPT
View on GitHub
An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.
☆64Oct 12, 2023Updated 2 years ago
PhoebusSi / Alpaca-CoT
View on GitHub
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…
☆2,791Dec 12, 2023Updated 2 years ago
xverse-ai / XVERSE-65B
View on GitHub
XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.
☆139Apr 9, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
yanqiangmiffy / InstructGLM
View on GitHub
ChatGLM-6B 指令学习|指令数据|Instruct
☆651Apr 10, 2023Updated 3 years ago
CVI-SZU / Linly
View on GitHub
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集
☆3,045Apr 14, 2024Updated 2 years ago
yangjianxin1 / Firefly
View on GitHub
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…
☆6,649Oct 24, 2024Updated last year
DachengLi1 / LongChat
View on GitHub
Official repository for LongChat and LongEval
☆536May 24, 2024Updated 2 years ago
yangjianxin1 / Firefly-LLaMA2-Chinese
View on GitHub
Firefly中文LLaMA-2大模型，支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型
☆415Oct 21, 2023Updated 2 years ago
hkust-nlp / ceval
View on GitHub
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
☆1,862Jul 27, 2025Updated last year
ssbuild / chatglm_rlhf
View on GitHub
chatglm_rlhf_finetuning
☆30Oct 10, 2023Updated 2 years ago
HarderThenHarder / transformers_tasks
View on GitHub
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SF…
☆2,421Sep 29, 2023Updated 2 years ago
liucann / CPMI-ChatGLM
View on GitHub
☆10Mar 18, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
WangRongsheng / CareGPT
View on GitHub
🌞 CareGPT (关怀GPT)是一个医疗大语言模型，同时它集合了数十个公开可用的医疗微调数据集和开放可用的医疗大语言模型，包含LLM的训练、测评、部署等以促进医疗LLM快速发展。Medical LLM, Open Source Driven for a Healthy…
☆997May 9, 2024Updated 2 years ago
X-jun-0130 / LLM-Pretrain-FineTune
View on GitHub
Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调
☆314May 3, 2026Updated 2 months ago
KwaiKEG / KwaiAgents
View on GitHub
A generalized information-seeking agent system with Large Language Models (LLMs).
☆1,203Jun 19, 2024Updated 2 years ago
beyondguo / LLM-Tuning
View on GitHub
Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.
☆1,015Apr 27, 2024Updated 2 years ago
xverse-ai / XVERSE-13B
View on GitHub
XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.
☆641Apr 9, 2024Updated 2 years ago
esbatmop / MNBVC
View on GitHub
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志…
☆4,244Jul 13, 2026Updated 2 weeks ago
Neutralzz / BiLLa
View on GitHub
BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability
☆415Jun 1, 2023Updated 3 years ago
GanjinZero / RRHF
View on GitHub
[NIPS2023] RRHF & Wombat
☆805Sep 22, 2023Updated 2 years ago
liucongg / ChatGLM-Finetuning
View on GitHub
基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型，进行下游具体任务微调，涉及Freeze、Lora、P-tuning、全参微调等
☆2,773Dec 12, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
View on GitHub
Instruction Tuning with GPT-4
☆4,333Jun 11, 2023Updated 3 years ago
BAAI-Zlab / COIG
View on GitHub
☆128May 27, 2023Updated 3 years ago
AetherCortex / Llama-X
View on GitHub
Open Academic Research on Improving LLaMA to SOTA LLM
☆1,605Aug 30, 2023Updated 2 years ago
carbonz0 / alpaca-chinese-dataset
View on GitHub
alpaca中文指令微调数据集
☆395Mar 26, 2023Updated 3 years ago
threeColorFr / LLMforDialogDataGenerate
View on GitHub
Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集
☆162Oct 25, 2023Updated 2 years ago
dandelionsllm / pandallm
View on GitHub
Panda项目是于2023年5月启动的开源海外中文大语言模型项目，致力于大模型时代探索整个技术栈，旨在推动中文自然语言处理领域的创新和合作。
☆1,032Oct 19, 2023Updated 2 years ago
sufengniu / RefGPT
View on GitHub
☆164Apr 17, 2023Updated 3 years ago