Magnetic2014 / llm-alignment-survey
A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey" for more details!
โ65Updated 11 months ago
Related projects: โ
- Feeling confused about super alignment? Here is a reading listโ42Updated 8 months ago
- ๐ An unofficial implementation of Self-Alignment with Instruction Backtranslation.โ128Updated 2 months ago
- Achieving Efficient Alignment through Learned Correctionโ103Updated 3 months ago
- The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"โ62Updated 3 weeks ago
- [SIGIR'24] The official implementation code of MOELoRA.โ113Updated last month
- AI Alignment: A Comprehensive Surveyโ123Updated 10 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witโฆโ60Updated 2 months ago
- โ71Updated 8 months ago
- A reading list on LLM based Synthetic Data Generation ๐ฅโ100Updated last month
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.โ119Updated last year
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"โ47Updated 10 months ago
- Awesome papers for role-playing with language modelsโ88Updated last month
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuningโ27Updated 7 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?โ62Updated 7 months ago
- BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).โ99Updated 10 months ago
- Do Large Language Models Know What They Donโt Know?โ84Updated 9 months ago
- ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenariosโ63Updated 5 months ago
- ๐งฌ RegMix: Data Mixture as Regression for Language Model Pre-trainingโ78Updated last month
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐งฎโจโ72Updated 4 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsโ134Updated 2 months ago
- Controllable Text Generation for Large Language Models: A Surveyโ88Updated 3 weeks ago
- โ46Updated 2 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learningโ144Updated 7 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".โ81Updated 3 weeks ago
- Official completion of โTraining on the Benchmark Is Not All You Needโ.โ18Updated this week
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..โ108Updated last week
- โ124Updated 2 months ago
- โ82Updated 5 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningโ101Updated last week
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`โ134Updated 6 months ago