dada-qin / Data-Centric_LLM_Studies
A list of papers about data quality in Large Language Models (LLMs)
☆20Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for Data-Centric_LLM_Studies
- Survey on Data-centric Large Language Models☆63Updated 4 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆123Updated 3 months ago
- ☆115Updated 3 months ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆223Updated 6 months ago
- ☆60Updated 2 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆152Updated 9 months ago
- ☆70Updated 10 months ago
- ☆24Updated 8 months ago
- ☆149Updated 2 weeks ago
- ☆37Updated 5 months ago
- UniGen: A Unified Framework for Dataset Generation via Large Language Model☆28Updated last month
- The related works and background techniques about Openai o1☆137Updated this week
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆81Updated last month
- Paper collections of retrieval-based (augmented) language model.☆229Updated 5 months ago
- Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models☆133Updated this week
- Paper List for In-context Learning 🌷☆169Updated 8 months ago
- A paper list about diffusion models for natural language processing.☆175Updated last year
- ☆141Updated 3 weeks ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆144Updated 3 weeks ago
- A curated list of awesome Multimodal studies.☆92Updated last week
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆32Updated 9 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆53Updated 3 months ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆80Updated 5 months ago
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆53Updated 4 months ago
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆99Updated last month
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆89Updated 2 months ago
- ☆34Updated 2 months ago
- This repository has transferred to https://github.com/TUDB-Labs/MoE-PEFT☆24Updated 2 months ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆142Updated 4 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆96Updated last week