Dataset pruning for ImageNet and LAION-2B.
☆81Jul 5, 2024Updated last year
Alternatives and similar repositories for Dataset-Pruning
Users that are interested in Dataset-Pruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of visual instruction tuning datasets.☆77Mar 14, 2024Updated 2 years ago
- [ICLR 2024] Real-Fake: Effective Training Data Synthesis Through Distribution Matching☆79Dec 9, 2023Updated 2 years ago
- SVIT: Scaling up Visual Instruction Tuning☆168Jun 20, 2024Updated last year
- ☆10May 21, 2026Updated 3 weeks ago
- Data distillation benchmark☆72Jun 13, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated 2 years ago
- [NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…☆14Oct 12, 2023Updated 2 years ago
- ☆30Apr 12, 2024Updated 2 years ago
- You Only Condense Once: Two Rules for Pruning Condensed Datasets (NeurIPS 2023)☆16Nov 18, 2023Updated 2 years ago
- A family of lightweight multimodal models.☆1,054Nov 18, 2024Updated last year
- ☆91Jan 22, 2023Updated 3 years ago
- ☆36Apr 13, 2026Updated 2 months ago
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆103Mar 22, 2024Updated 2 years ago
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆83Feb 24, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆40Mar 25, 2023Updated 3 years ago
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆153Oct 1, 2023Updated 2 years ago
- ☆33Mar 24, 2023Updated 3 years ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated last year
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆142Nov 15, 2024Updated last year
- PyTorch implementation of paper "Sparse Parameterization for Epitomic Dataset Distillation" in NeurIPS 2023.☆20Jun 28, 2024Updated last year
- ☆42Sep 5, 2023Updated 2 years ago
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆346Sep 24, 2024Updated last year
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆57Sep 19, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆56Mar 19, 2025Updated last year
- ☆42Sep 21, 2023Updated 2 years ago
- Efficient Dataset Distillation by Representative Matching☆114Feb 28, 2024Updated 2 years ago
- ☆15Apr 25, 2023Updated 3 years ago
- DreamGaussian with 2D-GS☆12Oct 10, 2024Updated last year
- Official PyTorch Implementation for the "Distilling Datasets Into Less Than One Image" paper.☆39Jun 6, 2024Updated 2 years ago
- Awesome coreset/core-set/subset/sample selection works.☆184Jun 30, 2024Updated last year
- [ICCV2023] Dataset Quantization☆262Jan 6, 2024Updated 2 years ago
- A curated list of awesome papers on dataset distillation and related applications.☆1,944Jun 12, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆13Dec 13, 2024Updated last year
- Code for ECCV 2022 paper “Learning with Recoverable Forgetting”☆21Jul 27, 2022Updated 3 years ago
- ☆11Jan 2, 2026Updated 5 months ago
- Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation (CVPR24)☆10Jun 16, 2024Updated 2 years ago
- ☆31Dec 20, 2022Updated 3 years ago
- A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo☆35Aug 12, 2024Updated last year
- Code for coreset selection methods☆256Feb 27, 2023Updated 3 years ago