MLGroup-JLU / LLM-data-aug-surveyLinks
The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"
☆128Updated last year
Alternatives and similar repositories for LLM-data-aug-survey
Users that are interested in LLM-data-aug-survey are comparing it to the libraries listed below
Sorting:
- ☆125Updated last year
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆128Updated 11 months ago
- Awesome-Large-Search-Models is a collection of papers and resources (Methods, Datasets and other resources) about awesome agentic search …☆120Updated 2 weeks ago
- [ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Model…☆146Updated 3 months ago
- A Toolkit for Table-based Question Answering☆112Updated last year
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆80Updated last year
- ☆100Updated last year
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"☆118Updated last week
- Fantastic Data Engineering for Large Language Models☆90Updated 8 months ago
- ☆54Updated last year
- ☆73Updated 7 months ago
- Collect every awesome work about r1!☆416Updated 4 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆178Updated last year
- 珠算代码大模型(Abacus Code LLM)☆56Updated 11 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆129Updated 10 months ago
- A live reading list for LLM data synthesis (Updated to July, 2025).☆374Updated 3 weeks ago
- Counting-Stars (★)☆83Updated 3 months ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆186Updated last month
- ☆147Updated last year
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆247Updated 10 months ago
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆57Updated 4 months ago
- Token level visualization tools for large language models☆88Updated 8 months ago
- ☆67Updated 7 months ago
- 顾名思义:手搓的RAG☆127Updated last year
- ☆49Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆272Updated 2 years ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆58Updated 11 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆152Updated last year
- Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning☆84Updated last year
- Neural Code Intelligence Survey 2024; Reading lists and resources☆268Updated last month