BAAI-DCAI / Dataset-PruningView external linksLinks
Dataset pruning for ImageNet and LAION-2B.
☆79Jul 5, 2024Updated last year
Alternatives and similar repositories for Dataset-Pruning
Users that are interested in Dataset-Pruning are comparing it to the libraries listed below
Sorting:
- A collection of visual instruction tuning datasets.☆76Mar 14, 2024Updated last year
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆57Apr 24, 2023Updated 2 years ago
- [ICLR 2024] Real-Fake: Effective Training Data Synthesis Through Distribution Matching☆78Dec 9, 2023Updated 2 years ago
- You Only Condense Once: Two Rules for Pruning Condensed Datasets (NeurIPS 2023)☆15Nov 18, 2023Updated 2 years ago
- SVIT: Scaling up Visual Instruction Tuning☆166Jun 20, 2024Updated last year
- [NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…☆14Oct 12, 2023Updated 2 years ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- A family of lightweight multimodal models.☆1,051Nov 18, 2024Updated last year
- Data distillation benchmark☆72Jun 13, 2025Updated 8 months ago
- PyTorch implementation of paper "Sparse Parameterization for Epitomic Dataset Distillation" in NeurIPS 2023.☆20Jun 28, 2024Updated last year
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆81Feb 24, 2025Updated 11 months ago
- ☆41Sep 21, 2023Updated 2 years ago
- ☆91Jan 22, 2023Updated 3 years ago
- ☆42Sep 5, 2023Updated 2 years ago
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆40Mar 25, 2023Updated 2 years ago
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆136Nov 15, 2024Updated last year
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29May 27, 2024Updated last year
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆104Mar 22, 2024Updated last year
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- ☆10Oct 20, 2023Updated 2 years ago
- Efficient Dataset Distillation by Representative Matching☆113Feb 28, 2024Updated last year
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆151Oct 1, 2023Updated 2 years ago
- ☆14Apr 25, 2023Updated 2 years ago
- This repository provides a multi task benchmark for instance segmentation, depth estimation, and 3D object detection.☆14Jul 29, 2023Updated 2 years ago
- STVQA and TextVQA OCR results from Amazon Text in Image pipeline☆11Jul 18, 2022Updated 3 years ago
- Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features☆25Nov 15, 2021Updated 4 years ago
- ☆30Nov 5, 2024Updated last year
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆91Feb 14, 2025Updated last year
- Official PyTorch implementation of "Dataset Condensation via Efficient Synthetic-Data Parameterization" (ICML'22)☆116Oct 18, 2023Updated 2 years ago
- Official PyTorch Implementation for the "Distilling Datasets Into Less Than One Image" paper.☆39Jun 6, 2024Updated last year
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆343Sep 24, 2024Updated last year
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆55Sep 19, 2024Updated last year
- Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.☆30Dec 21, 2025Updated last month
- This repository compiles a list of papers/resources related to the graph retrieval-augmented generation! Star⭐ the repo and follow me if …☆11Dec 7, 2024Updated last year
- ☆19May 13, 2022Updated 3 years ago
- [TNNLS] Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases☆16Jul 10, 2025Updated 7 months ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Jun 20, 2023Updated 2 years ago
- Awesome coreset/core-set/subset/sample selection works.☆182Jun 30, 2024Updated last year
- LowFER: Low-rank Bilinear Pooling for Link Prediction (ICML 2020)☆13Sep 24, 2022Updated 3 years ago