BinFuPKU / LLM-Alignment
A Survey of LLM Alignment (SFT & RLHF), and A Survey of RLHF methods (2023~2024)
☆16Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for LLM-Alignment
- ☆88Updated 4 months ago
- ☆62Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆67Updated last week
- ☆120Updated 7 months ago
- A collection for math word problem (MWP) works, including datasets, algorithms and so on.☆36Updated 5 months ago
- ☆71Updated 10 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆218Updated last year
- Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation☆37Updated 5 months ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated 7 months ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆174Updated this week
- ☆34Updated 2 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆28Updated 4 months ago
- code for piccolo embedding model from SenseTime☆111Updated 6 months ago
- ☆93Updated 8 months ago
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆151Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆109Updated 5 months ago
- 使用单个24G显卡,从0开始训练LLM☆49Updated 3 weeks ago
- Baichuan-13B 指令微调☆89Updated last year
- ☆130Updated 8 months ago
- ☆53Updated 4 months ago
- ☆91Updated 11 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆127Updated 3 months ago
- deepspeed+trainer简单高效实现多卡微调大模型☆116Updated last year
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆308Updated 2 months ago
- ☆157Updated last year
- 中文大语言模型评测第二期☆70Updated last year
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆155Updated 8 months ago
- 大语言模型指令调优工具(支持 FlashAttention)☆166Updated 10 months ago
- 中文原生检索增强生成测评基准☆100Updated 7 months ago
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Gener…☆58Updated 4 months ago