BinFuPKU / LLM-AlignmentLinks

A Survey of LLM Alignment (SFT & RLHF), and A Survey of RLHF methods (2023~2024)

☆18

Alternatives and similar repositories for LLM-Alignment

Users that are interested in LLM-Alignment are comparing it to the libraries listed below

Sorting:

CASIA-LM / MoDS
☆141Updated last year
DA-southampton / RedGPT
☆63Updated 2 years ago
muyaostudio / qwen2_seq_cls
使用 Qwen2ForSequenceClassification 简单实现文本分类任务。
☆67Updated last year
tjunlp-lab / M3KE
A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark
☆102Updated last year
mutonix / RefGPT
☆97Updated last year
sufengniu / RefGPT
☆162Updated 2 years ago
IronBeliever / CaR
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
☆81Updated 7 months ago
CSHaitao / ChatGLM_mutli_gpu_tuning
deepspeed+trainer简单高效实现多卡微调大模型
☆126Updated 2 years ago
OpenMOSS / HalluQA
Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"
☆131Updated last year
yuanzhoulvpi2017 / SentenceEmbedding
☆111Updated 11 months ago
THUIR / T2Ranking
T2Ranking: A large-scale Chinese benchmark for passage ranking.
☆159Updated last year
suu990901 / LLaMA-MiLe-Loss
Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models
☆63Updated 4 months ago
sugarandgugu / Simple-Trl-Training
基于DPO算法微调语言大模型，简单好上手。
☆39Updated 11 months ago
lansinuote / Simple_LLM_DPO
☆69Updated last year
pldlgb / nuggets
☆82Updated last year
mark1879 / Baichuan-13B-Finetuning
Baichuan-13B 指令微调
☆90Updated last year
zjunlp / IEPile
[ACL 2024] IEPile: A Large-Scale Information Extraction Corpus
☆196Updated 5 months ago
tianyi-lab / Cherry_LLM
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…
☆373Updated 9 months ago
zejunwang1 / LLMTuner
大语言模型指令调优工具（支持 FlashAttention）
☆173Updated last year
zhangzhao219 / WSDM-Cup-2024
1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc
☆160Updated last year
PolarisRisingWar / Math_Word_Problem_Collection
A collection for math word problem (MWP) works, including datasets, algorithms and so on.
☆43Updated last year
MikeGu721 / XiezhiBenchmark
☆97Updated last year
yangjianxin1 / SimCSE
SimCSE有监督与无监督实验复现
☆148Updated last year
CASIA-LM / ChineseWebText
☆169Updated last year
yanqiangmiffy / how-to-train-tokenizer
怎么训练一个LLM分词器
☆150Updated last year
pengr / LLM-Synthetic-Data
Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥
☆262Updated 5 months ago
Alibaba-NLP / Multi-CPR
[SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval
☆188Updated 2 years ago
zhpmatrix / PaperReading
每天阅读过的论文的简要笔记
☆209Updated 5 months ago
llmeval / llmeval-2
中文大语言模型评测第二期
☆70Updated last year
dawoshi / Tianchi-LLM-QA
阿里天池: 2023全球智能汽车AI挑战赛——赛道一：AI大模型检索问答 baseline 80+
☆105Updated last year