thu-coai / BPOLinks

☆328

Alternatives and similar repositories for BPO

Users that are interested in BPO are comparing it to the libraries listed below

Sorting:

tianyi-lab / Cherry_LLM
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…
☆397Updated 3 months ago
X-PLUG / Multi-LLM-Agent
☆232Updated last year
yangjianxin1 / LongQLoRA
LongQLoRA: Extent Context Length of LLMs Efficiently
☆166Updated last year
morecry / CharacterEval
☆269Updated 4 months ago
sufengniu / RefGPT
☆163Updated 2 years ago
thu-coai / CritiqueLLM
☆147Updated last year
CLUEbenchmark / SuperCLUE-Agent
SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准
☆92Updated last year
THUDM / AlignBench
大模型多维度中文对齐评测基准 (ACL 2024)
☆414Updated last year
BAAI-Zlab / COIG
☆128Updated 2 years ago
OFA-Sys / InsTag
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
☆276Updated 2 years ago
open-compass / BotChat
Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.
☆158Updated 5 months ago
zejunwang1 / LLMTuner
大语言模型指令调优工具（支持 FlashAttention）
☆178Updated last year
SupritYoung / RLHF-Label-Tool
用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.
☆254Updated 2 years ago
DA-southampton / RedGPT
☆68Updated 2 years ago
bojone / NBCE
Naive Bayes-based Context Extension
☆324Updated 10 months ago
CASIA-LM / ChineseWebText
☆179Updated last year
zjunlp / EasyInstruct
[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.
☆405Updated 9 months ago
YJiangcm / Lion
[EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models
☆211Updated last year
FreedomIntelligence / InstructionZoo
☆281Updated last year
InteractiveNLP-Team / RoleLLM-public
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
☆506Updated last year
hkust-nlp / deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆571Updated 10 months ago
flageval-baai / FlagEval
FlagEval is an evaluation toolkit for AI large foundation models.
☆338Updated 5 months ago
GAIR-NLP / auto-j
Generative Judge for Evaluating Alignment
☆247Updated last year
twang2218 / vocab-coverage
语言模型中文认知能力分析
☆236Updated 2 years ago
CASIA-LM / MoDS
☆145Updated last year
GAIR-NLP / abel
SOTA Math Opensource LLM
☆333Updated last year
git-cloner / llama2-lora-fine-tuning
llama2 finetuning with deepspeed and lora
☆176Updated 2 years ago
MikeGu721 / XiezhiBenchmark
☆97Updated last year
the-seeds / imitater
Imitate OpenAI with Local Models
☆88Updated last year
OpenMOSS / HalluQA
Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"
☆135Updated last year