gszfwsb / Awesome-Dataset-ReductionLinks
A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset selection).
☆58Updated 7 months ago
Alternatives and similar repositories for Awesome-Dataset-Reduction
Users that are interested in Awesome-Dataset-Reduction are comparing it to the libraries listed below
Sorting:
- Code for our ICML'24 on multimodal dataset distillation☆38Updated 10 months ago
- ☆49Updated 9 months ago
- [CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)☆28Updated 10 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆71Updated 2 months ago
- Provide .bst files for NeurIPS latex template☆49Updated 4 months ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆119Updated 3 weeks ago
- ☆136Updated 6 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆220Updated 8 months ago
- Survey on Data-centric Large Language Models☆84Updated last year
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆73Updated 6 months ago
- Prioritize Alignment in Dataset Distillation☆20Updated 8 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆37Updated last month
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆98Updated 8 months ago
- Paper List of Inference/Test Time Scaling/Computing☆294Updated last month
- A library of visualization tools for the interpretability and hallucination analysis of large vision-language models (LVLMs).☆37Updated 3 months ago
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆95Updated last year
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆101Updated last year
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆131Updated last month
- Monitor Google Scholar author citation counts and track changes automatically without opening tabs.☆66Updated 2 weeks ago
- ☆29Updated 2 years ago
- 抢占显卡☆77Updated 10 months ago
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆14Updated 2 months ago
- Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning☆25Updated 5 months ago
- IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models☆59Updated last year
- Survey: https://arxiv.org/pdf/2507.20198☆107Updated last week
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆78Updated 6 months ago
- Code for DeCo: Decoupling token compression from semanchc abstraction in multimodal large language models☆67Updated last month
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆31Updated 9 months ago
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆129Updated 9 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆66Updated 5 months ago