MLGroup-JLU / LLM-data-aug-surveyLinks
The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"
β126Updated 11 months ago
Alternatives and similar repositories for LLM-data-aug-survey
Users that are interested in LLM-data-aug-survey are comparing it to the libraries listed below
Sorting:
- β124Updated last year
- Real-time updated, fine-grained reading list on LLM-synthetic-data.π₯β262Updated 5 months ago
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.β123Updated 9 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"β¦β81Updated last year
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".β122Updated 7 months ago
- A Comprehensive Survey on Long Context Language Modelingβ152Updated 3 weeks ago
- A Toolkit for Table-based Question Answeringβ112Updated last year
- [SIGIR'24] The official implementation code of MOELoRA.β168Updated 11 months ago
- A curated list of awesome works in Routing LLMs paradigm (π Welcome to submit your contributions to this code repository)β40Updated last month
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Tokenβ144Updated 11 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningβ160Updated this week
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scaleβ251Updated 3 weeks ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"β126Updated 9 months ago
- Awesome-Large-Search-Models is a collection of papers and resources (Methods, Datasets and other resources) about search-oriented large rβ¦β110Updated last week
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.β42Updated last year
- Fantastic Data Engineering for Large Language Modelsβ89Updated 5 months ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`β179Updated 6 months ago
- β53Updated 9 months ago
- [ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Modelβ¦β133Updated 3 weeks ago
- β66Updated 4 months ago
- β66Updated 5 months ago
- β142Updated 11 months ago
- β142Updated 11 months ago
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"β73Updated this week
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memoryβ128Updated this week
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"β81Updated 9 months ago
- An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.β154Updated this week
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other moβ¦β373Updated 9 months ago
- β133Updated 9 months ago
- β109Updated 7 months ago