BinFuPKU / LLM-AlignmentLinks
A Survey of LLM Alignment (SFT & RLHF), and A Survey of RLHF methods (2023~2024)
☆21Updated last year
Alternatives and similar repositories for LLM-Alignment
Users that are interested in LLM-Alignment are comparing it to the libraries listed below
Sorting:
- ☆114Updated last year
- 使用 Qwen2ForSequenceClassification 简单实现文本分类任务。☆78Updated last year
- deepspeed+trainer简单高效实现多卡微调大模 型☆128Updated 2 years ago
- basic framework for rag(retrieval augment generation)☆86Updated last year
- Universal information extraction with instruction learning☆391Updated 6 months ago
- ☆73Updated last year
- ☆67Updated 2 years ago
- 怎么训练一个LLM分词器☆152Updated 2 years ago
- 基于ChatGPT构建的中文self-instruct数据集☆118Updated 2 years ago
- ☆162Updated 2 years ago
- 大语言模型指令调优工具(支持 FlashAttention)☆178Updated last year
- 每天阅读过的论文的简要笔记☆210Updated last week
- ☆145Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆338Updated last year
- 中文 Instruction tuning datasets☆134Updated last year
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆385Updated 2 months ago
- 《ChatGPT原理与实战:大型语言模型的算法、技术和私有化》☆364Updated last year
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"☆82Updated last year
- an intro to retrieval augmented large language model☆300Updated last year
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆161Updated last month
- 开源SFT数据集整理,随时补充☆539Updated 2 years ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆102Updated 2 years ago
- 该仓库主要记录 LLMs 算法工程师相关的顶会论文研读笔记(多模态、PEFT、小样本QA问答、RAG、LMMs可解释性、Agents、CoT)☆351Updated last year
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆65Updated 6 months ago
- llama,chatglm 等模型的微调☆90Updated last year
- 语言模型中文认知能力分析☆237Updated last year
- baichuan LLM surpervised finetune by lora☆64Updated 2 years ago
- ChatGLM-6B添加了RLHF的实现,以及部分核心代码的逐行讲解 ,实例部分是做了个新闻短标题的生成,以及指定context推荐的RLHF的实现☆87Updated 2 years ago
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆160Updated 2 years ago
- Baichuan-13B 指令微调☆91Updated 2 years ago