Dataset pruning for ImageNet and LAION-2B.
☆79Jul 5, 2024Updated last year
Alternatives and similar repositories for Dataset-Pruning
Users that are interested in Dataset-Pruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of visual instruction tuning datasets.☆77Mar 14, 2024Updated 2 years ago
- [ICLR 2024] Real-Fake: Effective Training Data Synthesis Through Distribution Matching☆79Dec 9, 2023Updated 2 years ago
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆58Apr 24, 2023Updated 2 years ago
- SVIT: Scaling up Visual Instruction Tuning☆167Jun 20, 2024Updated last year
- ☆10Oct 20, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Data distillation benchmark☆72Jun 13, 2025Updated 10 months ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- Official Code for Dataset Distillation using Neural Feature Regression (NeurIPS 2022)☆48Nov 12, 2022Updated 3 years ago
- A family of lightweight multimodal models.☆1,053Nov 18, 2024Updated last year
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆103Mar 22, 2024Updated 2 years ago
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆83Feb 24, 2025Updated last year
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆40Mar 25, 2023Updated 3 years ago
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29May 27, 2024Updated last year
- STVQA and TextVQA OCR results from Amazon Text in Image pipeline☆12Jul 18, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆32Mar 24, 2023Updated 3 years ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated last year
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆139Nov 15, 2024Updated last year
- PyTorch implementation of paper "Sparse Parameterization for Epitomic Dataset Distillation" in NeurIPS 2023.☆20Jun 28, 2024Updated last year
- ☆42Sep 5, 2023Updated 2 years ago
- Vico: Compositional Video Generation as Flow Equalization☆59Nov 15, 2024Updated last year
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆347Sep 24, 2024Updated last year
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆57Sep 19, 2024Updated last year
- ☆55Mar 19, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Efficient Dataset Distillation by Representative Matching☆114Feb 28, 2024Updated 2 years ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- ☆11Jan 2, 2026Updated 3 months ago
- DreamGaussian with 2D-GS☆12Oct 10, 2024Updated last year
- Official PyTorch Implementation for the "Distilling Datasets Into Less Than One Image" paper.☆39Jun 6, 2024Updated last year
- Awesome coreset/core-set/subset/sample selection works.☆182Jun 30, 2024Updated last year
- A curated list of awesome papers on dataset distillation and related applications.☆1,924Apr 8, 2026Updated last week
- [ICCV2023] Dataset Quantization☆263Jan 6, 2024Updated 2 years ago
- ☆13Dec 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of "Private Set Generation with Discriminative Information" (NeurIPS 2022)☆18Aug 14, 2023Updated 2 years ago
- The official implementation of paper "Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for Enhanced Dataset Pruning" (CVPR …☆21Aug 20, 2024Updated last year
- Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation (CVPR24)☆11Jun 16, 2024Updated last year
- ☆31Dec 20, 2022Updated 3 years ago
- A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo☆35Aug 12, 2024Updated last year
- Code for coreset selection methods☆253Feb 27, 2023Updated 3 years ago
- Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks☆17Jan 15, 2025Updated last year