MLGroup-JLU / LLM-data-aug-surveyLinks

The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"

☆128

Alternatives and similar repositories for LLM-data-aug-survey

Users that are interested in LLM-data-aug-survey are comparing it to the libraries listed below

Sorting:

cavalierlulu / rag_survey
☆124Updated last year
zzz47zzz / awesome-lifelong-learning-methods-for-llm
[ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Model…
☆141Updated 2 months ago
junchenzhi / Awesome-LLM-Ensemble
A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"
☆105Updated 3 weeks ago
Wu-Zongyu / Awesome-Large-Search-Models
Awesome-Large-Search-Models is a collection of papers and resources (Methods, Datasets and other resources) about awesome agentic search …
☆111Updated last month
HqWu-HITCS / Awesome-Personalized-LLM
This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.
☆128Updated 10 months ago
Magnetic2014 / llm-alignment-survey
A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…
☆79Updated last year
yuleiqin / fantastic-data-engineering
Fantastic Data Engineering for Large Language Models
☆89Updated 7 months ago
lfy79001 / TableQAKit
A Toolkit for Table-based Question Answering
☆112Updated last year
LightChen233 / Awesome-LLM-for-NLP
☆100Updated last year
pengr / LLM-Synthetic-Data
A live reading list for LLM-synthetic-data.
☆343Updated last week
sail-sg / sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
☆125Updated 9 months ago
modelscope / awesome-deep-reasoning
Collect every awesome work about r1!
☆400Updated 3 months ago
liuqidong07 / MOELoRA-peft
[SIGIR'24] The official implementation code of MOELoRA.
☆174Updated last year
OFA-Sys / DiverseEvol
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning
☆83Updated last year
nick7nlp / Counting-Stars
Counting-Stars (★)
☆83Updated 2 months ago
PKU-Baichuan-MLSystemLab / PAS
☆53Updated 10 months ago
jinbo0906 / Awesome-MLLM-Datasets
This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …
☆53Updated 3 months ago
FlagOpen / Infinity-Instruct
☆49Updated last year
thu-coai / CritiqueLLM
☆144Updated last year
OpenStellarTeam / ChineseSimpleQA
☆70Updated 6 months ago
PKU-Alignment / AlignmentSurvey
AI Alignment: A Comprehensive Survey
☆135Updated last year
pldlgb / nuggets
☆83Updated last year
RUC-GSAI / Yulan-GARDEN
Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"
☆82Updated 11 months ago
RUC-GSAI / Llama-3-SynE
Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …
☆34Updated 2 months ago
DevoAllen / Awesome-Reasoning-Economy-Papers
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models
☆112Updated last month
QiushiSun / Awesome-Code-Intelligence
Neural Code Intelligence Survey 2024; Reading lists and resources
☆265Updated last week
LightChen233 / Awesome-Multilingual-LLM
☆106Updated 7 months ago
FairyFali / SLMs-Survey
Survey of Small Language Models from Penn State, ...
☆187Updated 2 weeks ago
nishiwen1214 / Benchmark-leakage-detection
Official completion of “Training on the Benchmark Is Not All You Need”.
☆35Updated 7 months ago
MilkThink-Lab / Awesome-Routing-LLMs
A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)
☆50Updated 3 weeks ago