dada-qin / Data-Centric_LLM_Studies
A list of papers about data quality in Large Language Models (LLMs)
β23Updated last year
Alternatives and similar repositories for Data-Centric_LLM_Studies:
Users that are interested in Data-Centric_LLM_Studies are comparing it to the libraries listed below
- Survey on Data-centric Large Language Modelsβ72Updated 6 months ago
- β82Updated this week
- [EMNLP 2024 Findingsπ₯] Official implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Infeβ¦β88Updated 2 months ago
- [SIGIR'24] The official implementation code of MOELoRA.β142Updated 5 months ago
- β78Updated last year
- β84Updated 4 months ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignmentβ270Updated 8 months ago
- β121Updated 5 months ago
- A Self-Training Framework for Vision-Language Reasoningβ60Updated 2 months ago
- This repository has transferred to https://github.com/TUDB-Labs/MoE-PEFTβ23Updated 5 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsβ53Updated 5 months ago
- A curated list of awesome Multimodal studies.β122Updated this week
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learningβ160Updated 11 months ago
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuningβ38Updated 11 months ago
- β57Updated 7 months ago
- Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Modelsβ235Updated last week
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-viβ¦β96Updated 3 months ago
- β49Updated 2 months ago
- State-of-the-art Parameter-Efficient MoE Fine-tuning Methodβ119Updated 4 months ago
- The code and data of DPA-RAGβ54Updated 3 months ago
- β36Updated 4 months ago
- β159Updated 6 months ago
- The demo, code and data of FollowRAGβ68Updated last month
- Paper collections of multi-modal LLM for Math/STEM/Code.β54Updated last week
- β11Updated last month
- β26Updated 2 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?β76Updated 11 months ago
- β44Updated 3 months ago
- β14Updated last month
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.β120Updated last week