Dataset pruning for ImageNet and LAION-2B.
☆80Jul 5, 2024Updated last year
Alternatives and similar repositories for Dataset-Pruning
Users that are interested in Dataset-Pruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of visual instruction tuning datasets.☆77Mar 14, 2024Updated 2 years ago
- [ICLR 2024] Real-Fake: Effective Training Data Synthesis Through Distribution Matching☆79Dec 9, 2023Updated 2 years ago
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆58Apr 24, 2023Updated 3 years ago
- SVIT: Scaling up Visual Instruction Tuning☆168Jun 20, 2024Updated last year
- ☆10Oct 20, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆17Apr 28, 2024Updated 2 years ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- [NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…☆14Oct 12, 2023Updated 2 years ago
- Official Code for Dataset Distillation using Neural Feature Regression (NeurIPS 2022)☆48Nov 12, 2022Updated 3 years ago
- ☆30Apr 12, 2024Updated 2 years ago
- You Only Condense Once: Two Rules for Pruning Condensed Datasets (NeurIPS 2023)☆16Nov 18, 2023Updated 2 years ago
- A family of lightweight multimodal models.☆1,054Nov 18, 2024Updated last year
- AI-GenBench: A New Ongoing Benchmark for AI-Generated Image Detection☆32Feb 2, 2026Updated 3 months ago
- ☆36Apr 13, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆103Mar 22, 2024Updated 2 years ago
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆83Feb 24, 2025Updated last year
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆40Mar 25, 2023Updated 3 years ago
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆153Oct 1, 2023Updated 2 years ago
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29May 27, 2024Updated last year
- ☆32Mar 24, 2023Updated 3 years ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated last year
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆139Nov 15, 2024Updated last year
- PyTorch implementation of paper "Sparse Parameterization for Epitomic Dataset Distillation" in NeurIPS 2023.☆20Jun 28, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆42Sep 5, 2023Updated 2 years ago
- ☆15Apr 25, 2023Updated 3 years ago
- Code to conduct an embedding attack on LLMs☆32Jan 10, 2025Updated last year
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆346Sep 24, 2024Updated last year
- ☆56Mar 19, 2025Updated last year
- ☆42Sep 21, 2023Updated 2 years ago
- Efficient Dataset Distillation by Representative Matching☆114Feb 28, 2024Updated 2 years ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- ☆11Jan 2, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- DreamGaussian with 2D-GS☆12Oct 10, 2024Updated last year
- Official PyTorch implementation of "Dataset Condensation via Efficient Synthetic-Data Parameterization" (ICML'22)☆116Oct 18, 2023Updated 2 years ago
- Awesome coreset/core-set/subset/sample selection works.☆182Jun 30, 2024Updated last year
- A curated list of awesome papers on dataset distillation and related applications.☆1,930Apr 28, 2026Updated last week
- Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features☆25Nov 15, 2021Updated 4 years ago
- Official implementation of "Private Set Generation with Discriminative Information" (NeurIPS 2022)☆18Aug 14, 2023Updated 2 years ago
- The official implementation of paper "Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for Enhanced Dataset Pruning" (CVPR …☆21Aug 20, 2024Updated last year