MLGroup-JLU / LLM-data-aug-surveyLinks
The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"
☆128Updated last year
Alternatives and similar repositories for LLM-data-aug-survey
Users that are interested in LLM-data-aug-survey are comparing it to the libraries listed below
Sorting:
- ☆124Updated last year
- [ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Model…☆141Updated 2 months ago
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"☆105Updated 3 weeks ago
- Awesome-Large-Search-Models is a collection of papers and resources (Methods, Datasets and other resources) about awesome agentic search …☆111Updated last month
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆128Updated 10 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆79Updated last year
- Fantastic Data Engineering for Large Language Models☆89Updated 7 months ago
- A Toolkit for Table-based Question Answering☆112Updated last year
- ☆100Updated last year
- A live reading list for LLM-synthetic-data.☆343Updated last week
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆125Updated 9 months ago
- Collect every awesome work about r1!☆400Updated 3 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆174Updated last year
- Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning☆83Updated last year
- Counting-Stars (★)☆83Updated 2 months ago
- ☆53Updated 10 months ago
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆53Updated 3 months ago
- ☆49Updated last year
- ☆144Updated last year
- ☆70Updated 6 months ago
- AI Alignment: A Comprehensive Survey☆135Updated last year
- ☆83Updated last year
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"☆82Updated 11 months ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆34Updated 2 months ago
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆112Updated last month
- Neural Code Intelligence Survey 2024; Reading lists and resources☆265Updated last week
- ☆106Updated 7 months ago
- Survey of Small Language Models from Penn State, ...☆187Updated 2 weeks ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆35Updated 7 months ago
- A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)☆50Updated 3 weeks ago