gszfwsb / Awesome-Dataset-ReductionLinks
A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset selection).
☆58Updated 10 months ago
Alternatives and similar repositories for Awesome-Dataset-Reduction
Users that are interested in Awesome-Dataset-Reduction are comparing it to the libraries listed below
Sorting:
- ☆54Updated last year
- Provide .bst files for NeurIPS latex template☆49Updated 7 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆91Updated 2 months ago
- [CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)☆28Updated last year
- Code for our ICML'24 on multimodal dataset distillation☆42Updated last year
- [EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆91Updated last month
- 关于LLM和Multimodal LLM的paper list☆50Updated this week
- Latest Advances on Modality Priors in Multimodal Large Language Models☆28Updated 2 months ago
- Survey: https://arxiv.org/pdf/2507.20198☆228Updated last month
- One-shot Entropy Minimization☆187Updated 5 months ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆24Updated 9 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆213Updated last month
- Paper List of Inference/Test Time Scaling/Computing☆326Updated 3 months ago
- ☆151Updated 9 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆234Updated last year
- A paper list of Awesome Latent Space.☆123Updated this week
- ☆30Updated 2 years ago
- Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆71Updated 2 months ago
- A library of visualization tools for the interpretability and hallucination analysis of large vision-language models (LVLMs).☆42Updated 6 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆55Updated 5 months ago
- Code release for VTW (AAAI 2025 Oral)☆65Updated last month
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆122Updated 2 months ago
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)☆53Updated 10 months ago
- IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models☆59Updated last year
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆104Updated last year
- Data distillation benchmark☆71Updated 5 months ago
- Prioritize Alignment in Dataset Distillation☆20Updated last year
- Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning☆27Updated 8 months ago
- Awesome Low-Rank Adaptation☆55Updated 4 months ago
- ☆185Updated 6 months ago