MLGroup-JLU / LLM-data-aug-surveyLinks
The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"
☆129Updated last year
Alternatives and similar repositories for LLM-data-aug-survey
Users that are interested in LLM-data-aug-survey are comparing it to the libraries listed below
Sorting:
- ☆125Updated last year
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆130Updated last year
- [ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Model…☆150Updated 4 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆184Updated last year
- A Toolkit for Table-based Question Answering☆114Updated 2 years ago
- ☆102Updated last year
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆81Updated 2 years ago
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"☆150Updated last week
- Collect every awesome work about r1!☆420Updated 5 months ago
- Fantastic Data Engineering for Large Language Models☆91Updated 9 months ago
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆58Updated 5 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆131Updated 11 months ago
- ☆74Updated 9 months ago
- ☆54Updated last year
- ☆169Updated 5 months ago
- 珠算代码大模型(Abacus Code LLM)☆56Updated last year
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆119Updated last week
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆180Updated 4 months ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆190Updated 2 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆63Updated this week
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆40Updated last month
- Neural Code Intelligence Survey 2024; Reading lists and resources☆275Updated 3 months ago
- ☆116Updated last year
- Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".☆106Updated 2 months ago
- ☆49Updated last year
- Awesome LLM pre-training resources, including data, frameworks, and methods.☆269Updated 5 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆251Updated 11 months ago
- Token level visualization tools for large language models☆89Updated 9 months ago
- ☆67Updated 8 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆44Updated last year