Magnetic2014 / llm-alignment-survey
A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey" for more details!
โ72Updated last year
Alternatives and similar repositories for llm-alignment-survey:
Users that are interested in llm-alignment-survey are comparing it to the libraries listed below
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"โ98Updated 3 months ago
- ๐ An unofficial implementation of Self-Alignment with Instruction Backtranslation.โ136Updated 6 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correctโ138Updated last month
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuningโ38Updated 11 months ago
- Collection of papers for scalable automated alignment.โ82Updated 2 months ago
- BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).โ118Updated last year
- โ78Updated last year
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"โ47Updated last year
- Feeling confused about super alignment? Here is a reading listโ42Updated last year
- AI Alignment: A Comprehensive Surveyโ133Updated last year
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witโฆโ102Updated 6 months ago
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.โ128Updated last year
- โ60Updated 6 months ago
- โ38Updated last year
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsโ154Updated 6 months ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feeโฆโ37Updated 5 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".โ107Updated 2 months ago
- โ136Updated 6 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐งฎโจโ145Updated 8 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learningโ160Updated 11 months ago
- Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Modelsโ85Updated 5 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".โ62Updated this week
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningโ137Updated 4 months ago
- Counting-Stars (โ )โ78Updated 4 months ago
- Fantastic Data Engineering for Large Language Modelsโ64Updated 2 weeks ago
- Do Large Language Models Know What They Donโt Know?โ88Updated 2 months ago
- [SIGIR'24] The official implementation code of MOELoRA.โ142Updated 5 months ago
- โ48Updated 10 months ago
- Code implementation of synthetic continued pretrainingโ79Updated last week
- Reference implementation for Token-level Direct Preference Optimization(TDPO)โ124Updated 6 months ago