BinFuPKU / LLM-AlignmentLinks
A Survey of LLM Alignment (SFT & RLHF), and A Survey of RLHF methods (2023~2024)
☆21Updated last year
Alternatives and similar repositories for LLM-Alignment
Users that are interested in LLM-Alignment are comparing it to the libraries listed below
Sorting:
- ☆119Updated last year
- Universal information extraction with instruction learning☆393Updated 10 months ago
- 怎么训练一个LLM分词器☆154Updated 2 years ago
- ☆164Updated 2 years ago
- 《ChatGPT原理与实战:大型语言模型的算法、技术和私有化》☆369Updated 2 years ago
- basic framework for rag(retrieval augment generation)☆86Updated 2 years ago
- 使用 Qwen2ForSequenceClassification 简单实现文本分类任务。☆89Updated last year
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆162Updated 5 months ago
- 本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料,该资料目前包含 自然语言处理各领域的 面试题积累。☆107Updated 4 years ago
- PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese☆386Updated last year
- ☆77Updated 2 years ago
- ☆69Updated 2 years ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆209Updated last year
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆314Updated last year
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆103Updated 2 years ago
- 语言模型中文认知能力分析☆236Updated 2 years ago
- 开源SFT数据集整理,随时补充☆566Updated 2 years ago
- ☆147Updated last year
- A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.☆314Updated 2 years ago
- 该仓库主要记录 LLMs 算法工程师相关的顶会论文研读笔记(多模态、PEFT、小样本QA问答、RAG、LMMs可解释性、Agents、CoT)☆369Updated last year
- an intro to retrieval augmented large language model☆304Updated 2 years ago
- ☆213Updated last year
- 大模型多维度中文对齐评测基准 (ACL 2024)☆422Updated 2 months ago
- 大语言模型指令调优工具(支持 FlashAttention)☆178Updated 2 years ago
- A live reading list for LLM data synthesis (Updated to July, 2025).☆434Updated 4 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆412Updated 6 months ago
- 活字通用大模型☆390Updated last year
- llama,chatglm 等模型的微调☆91Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆86Updated last year
- 面向中文大模型价值观的评估与对齐研究☆549Updated 2 years ago