Dataset pruning for ImageNet and LAION-2B.
☆81Jul 5, 2024Updated last year
Alternatives and similar repositories for Dataset-Pruning
Users that are interested in Dataset-Pruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of visual instruction tuning datasets.☆77Mar 14, 2024Updated 2 years ago
- [ICLR 2024] Real-Fake: Effective Training Data Synthesis Through Distribution Matching☆79Dec 9, 2023Updated 2 years ago
- SVIT: Scaling up Visual Instruction Tuning☆168Jun 20, 2024Updated last year
- ☆17Apr 28, 2024Updated 2 years ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official Code for Dataset Distillation using Neural Feature Regression (NeurIPS 2022)☆48Nov 12, 2022Updated 3 years ago
- ☆30Apr 12, 2024Updated 2 years ago
- You Only Condense Once: Two Rules for Pruning Condensed Datasets (NeurIPS 2023)☆16Nov 18, 2023Updated 2 years ago
- A family of lightweight multimodal models.☆1,053Nov 18, 2024Updated last year
- AI-GenBench: A New Ongoing Benchmark for AI-Generated Image Detection☆35Feb 2, 2026Updated 3 months ago
- ☆36Apr 13, 2026Updated last month
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆103Mar 22, 2024Updated 2 years ago
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆83Feb 24, 2025Updated last year
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆40Mar 25, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆153Oct 1, 2023Updated 2 years ago
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29May 27, 2024Updated 2 years ago
- ☆32Mar 24, 2023Updated 3 years ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated last year
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆142Nov 15, 2024Updated last year
- PyTorch implementation of paper "Sparse Parameterization for Epitomic Dataset Distillation" in NeurIPS 2023.☆20Jun 28, 2024Updated last year
- Code to conduct an embedding attack on LLMs☆32Jan 10, 2025Updated last year
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆346Sep 24, 2024Updated last year
- ☆108Feb 20, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆56Mar 19, 2025Updated last year
- Efficient Dataset Distillation by Representative Matching☆114Feb 28, 2024Updated 2 years ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- ☆15Apr 25, 2023Updated 3 years ago
- DreamGaussian with 2D-GS☆12Oct 10, 2024Updated last year
- Official PyTorch implementation of "Dataset Condensation via Efficient Synthetic-Data Parameterization" (ICML'22)☆116Oct 18, 2023Updated 2 years ago
- Awesome coreset/core-set/subset/sample selection works.☆183Jun 30, 2024Updated last year
- [ICCV2023] Dataset Quantization☆263Jan 6, 2024Updated 2 years ago
- A curated list of awesome papers on dataset distillation and related applications.☆1,933May 19, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Dec 13, 2024Updated last year
- Code for ECCV 2022 paper “Learning with Recoverable Forgetting”☆21Jul 27, 2022Updated 3 years ago
- Official implementation of "Private Set Generation with Discriminative Information" (NeurIPS 2022)☆18Aug 14, 2023Updated 2 years ago
- The official implementation of paper "Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for Enhanced Dataset Pruning" (CVPR …☆21Aug 20, 2024Updated last year
- ☆31Dec 20, 2022Updated 3 years ago
- A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo☆35Aug 12, 2024Updated last year
- Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".☆28Feb 10, 2025Updated last year