MLGroup-JLU / LLM-data-aug-survey
The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"
☆116Updated 6 months ago
Alternatives and similar repositories for LLM-data-aug-survey:
Users that are interested in LLM-data-aug-survey are comparing it to the libraries listed below
- ☆120Updated 11 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆41Updated 7 months ago
- ☆172Updated 3 weeks ago
- ☆96Updated 8 months ago
- Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥☆201Updated this week
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆90Updated 4 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆74Updated last year
- [SIGIR'24] The official implementation code of MOELoRA.☆143Updated 6 months ago
- This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Models. (Updated Regularly)☆84Updated last week
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆107Updated 2 months ago
- A Toolkit for Table-based Question Answering☆109Updated last year
- ☆47Updated this week
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆100Updated 4 months ago
- ☆52Updated 4 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆138Updated 4 months ago
- The code and data of DPA-RAG☆55Updated last week
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆38Updated 11 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆209Updated 3 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆333Updated 4 months ago
- Survey of Small Language Models from Penn State, ...☆143Updated 2 weeks ago
- Fantastic Data Engineering for Large Language Models☆67Updated last month
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆93Updated last month
- ☆78Updated last year
- Awesome papers for role-playing with language models☆154Updated 2 months ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆156Updated last month
- 顾名思义:手搓的RAG☆116Updated 11 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆237Updated last year
- ☆96Updated 2 months ago
- ☆128Updated 9 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆156Updated 7 months ago