MLGroup-JLU / LLM-data-aug-survey
The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"
☆108Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for LLM-data-aug-survey
- ☆119Updated 9 months ago
- A Toolkit for Table-based Question Answering☆105Updated last year
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆71Updated last year
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆38Updated 4 months ago
- ☆40Updated 5 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆191Updated last month
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆125Updated 2 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆100Updated 2 weeks ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆146Updated last month
- ☆120Updated 7 months ago
- Awesome papers for role-playing with language models☆122Updated 2 weeks ago
- [SIGIR'24] The official implementation code of MOELoRA.☆124Updated 3 months ago
- LLaMA Factory Document☆73Updated last month
- 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota☆26Updated 3 months ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆159Updated last year
- ☆94Updated 6 months ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆155Updated 8 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆54Updated 2 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆86Updated 2 months ago
- Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)☆69Updated 9 months ago
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆33Updated 9 months ago
- The related works and background techniques about Openai o1☆142Updated last week
- ☆71Updated 10 months ago
- Continual Learning of Large Language Models: A Comprehensive Survey☆252Updated last week
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆220Updated 3 weeks ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆67Updated last week
- 顾名思义:手搓的RAG☆111Updated 8 months ago
- ☆53Updated 4 months ago
- ☆129Updated 4 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆130Updated 3 months ago