sail-sg / sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
☆96Updated last week
Related projects ⓘ
Alternatives and complementary repositories for sdft
- ☆70Updated 10 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆123Updated 3 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆123Updated 2 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆67Updated 5 months ago
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆32Updated 9 months ago
- ☆115Updated 3 months ago
- Fantastic Data Engineering for Large Language Models☆49Updated 3 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆81Updated last month
- LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models☆66Updated 3 weeks ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆111Updated last week
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆84Updated 3 months ago
- UniGen: A Unified Framework for Dataset Generation via Large Language Model☆28Updated last month
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆32Updated 3 months ago
- ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆62Updated 6 months ago
- ☆89Updated last month
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆37Updated 4 months ago
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models".☆36Updated this week
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆43Updated last week
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆141Updated 4 months ago
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆43Updated last week
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆70Updated 9 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆71Updated last year
- ☆48Updated 8 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆38Updated 4 months ago
- ☆53Updated 2 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆46Updated last month
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆94Updated 6 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆72Updated 7 months ago
- Feeling confused about super alignment? Here is a reading list☆43Updated 10 months ago
- ☆37Updated 5 months ago