Magnetic2014 / llm-alignment-survey
A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey" for more details!
β77Updated last year
Alternatives and similar repositories for llm-alignment-survey:
Users that are interested in llm-alignment-survey are comparing it to the libraries listed below
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"β110Updated 6 months ago
- Collection of papers for scalable automated alignment.β86Updated 5 months ago
- π An unofficial implementation of Self-Alignment with Instruction Backtranslation.β137Updated 9 months ago
- β80Updated last year
- β54Updated 5 months ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feeβ¦β38Updated 8 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learningβ161Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?β78Updated last year
- Feeling confused about super alignment? Here is a reading listβ42Updated last year
- A Comprehensive Survey on Long Context Language Modelingβ86Updated last week
- Reference implementation for Token-level Direct Preference Optimization(TDPO)β130Updated last month
- β113Updated 2 months ago
- β41Updated last year
- AI Alignment: A Comprehensive Surveyβ133Updated last year
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witβ¦β119Updated 8 months ago
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"β47Updated last year
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)β74Updated last month
- β33Updated last month
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".β116Updated 4 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".β73Updated 2 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.β42Updated 9 months ago
- [SIGIR'24] The official implementation code of MOELoRA.β153Updated 8 months ago
- β166Updated last month
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questionsβ107Updated 6 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".β52Updated 3 months ago
- Fantastic Data Engineering for Large Language Modelsβ84Updated 2 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsβ158Updated 9 months ago
- A research repo for experiments about Reinforcement Finetuningβ36Updated last week
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Modelsβ251Updated 6 months ago
- β49Updated last year