NUS-HPC-AI-Lab/InfoBatch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NUS-HPC-AI-Lab/InfoBatch)

NUS-HPC-AI-Lab / InfoBatch

Lossless Training Speed Up by Unbiased Dynamic Data Pruning

☆346

Alternatives and similar repositories for InfoBatch

Users that are interested in InfoBatch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Yanqing0327 / DREAM
View on GitHub
Efficient Dataset Distillation by Representative Matching
☆114Feb 28, 2024Updated 2 years ago
NUS-HPC-AI-Lab / DATM
View on GitHub
ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching
☆108May 23, 2024Updated 2 years ago
xyupeng / LC-Booster
View on GitHub
☆24Oct 14, 2022Updated 3 years ago
NUS-HPC-AI-Lab / DD-Ranking
View on GitHub
Data distillation benchmark
☆73Jun 13, 2025Updated last year
NUS-HPC-AI-Lab / Neural-Network-Diffusion
View on GitHub
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard laten…
☆887Jan 3, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NUS-HPC-AI-Lab / Helen
View on GitHub
The official implementation of "Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization"
☆16Mar 14, 2024Updated 2 years ago
kaiwang960112 / CAFE
View on GitHub
This is a method of dataset condensation, and it has been accepted by CVPR-2022.
☆73Dec 12, 2023Updated 2 years ago
DataDistillation / DataDAM
View on GitHub
[ICCV 2023] DataDAM: Efficient Dataset Distillation with Attention Matching
☆34Jun 20, 2024Updated 2 years ago
albertotamajo / imagenet1k-coarse-classes
View on GitHub
This repository organizes the Imagnet1k dataset into 10 coarse classes, where each class consists of semantically similar image categorie…
☆22Dec 11, 2023Updated 2 years ago
AngusDujw / FTD-distillation
View on GitHub
The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)
☆40Mar 25, 2023Updated 3 years ago
vimar-gu / MinimaxDiffusion
View on GitHub
[CVPR2024] Efficient Dataset Distillation via Minimax Diffusion
☆103Mar 22, 2024Updated 2 years ago
NUS-HPC-AI-Lab / Recurrent-Parameter-Generation
View on GitHub
The official implementation of Recurrent Diffusion for Large-Scale Parameter Generation.
☆81Sep 24, 2025Updated 9 months ago
VILA-Lab / SRe2L
View on GitHub
(NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …
☆141Nov 15, 2024Updated last year
rgeirhos / dataset-pruning-metrics
View on GitHub
Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)
☆58Apr 24, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Yanqing0327 / MLLMs-Augmented
View on GitHub
The official implementation of 《MLLMs-Augmented Visual-Language Representation Learning》
☆31Mar 12, 2024Updated 2 years ago
Huage001 / DatasetFactorization
View on GitHub
PyTorch implementation of paper "Dataset Distillation via Factorization" in NeurIPS 2022.
☆67Nov 28, 2022Updated 3 years ago
NUS-HPC-AI-Lab / PAD
View on GitHub
Prioritize Alignment in Dataset Distillation
☆21Dec 3, 2024Updated last year
Guang000 / Awesome-Dataset-Distillation
View on GitHub
A curated list of awesome papers on dataset distillation and related applications.
☆1,964Updated this week
NUS-HPC-AI-Lab / SpeeD
View on GitHub
SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
☆188Jan 27, 2025Updated last year
yongchaoz / FRePo
View on GitHub
Official Code for Dataset Distillation using Neural Feature Regression (NeurIPS 2022)
☆48Nov 12, 2022Updated 3 years ago
NUS-HPC-AI-Lab / InfoGrowth
View on GitHub
Efficient and Online Dataset Growth Algorithm (with cleanness and diversity awareness) to deal with growing web data
☆20Aug 6, 2024Updated last year
snu-mllab / Efficient-Dataset-Condensation
View on GitHub
Official PyTorch implementation of "Dataset Condensation via Efficient Synthetic-Data Parameterization" (ICML'22)
☆115Oct 18, 2023Updated 2 years ago
shaoshitong / G_VBSM_Dataset_Condensation
View on GitHub
[CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)
☆27Oct 9, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
he-y / you-only-condense-once
View on GitHub
You Only Condense Once: Two Rules for Pruning Condensed Datasets (NeurIPS 2023)
☆16Nov 18, 2023Updated 2 years ago
haizhongzheng / Coverage-centric-coreset-selection
View on GitHub
☆42Sep 21, 2023Updated 2 years ago
hrtan / MoSo
View on GitHub
[NeurIPS-2023] The PyTorch Implementation of MoSo. The algorithms are based on our paper: "Data Pruning via Moving-one-Sample-out". MoSo …
☆10May 21, 2026Updated 2 months ago
BAAI-DCAI / Dataset-Pruning
View on GitHub
Dataset pruning for ImageNet and LAION-2B.
☆80Jul 5, 2024Updated 2 years ago
Tencent-Hunyuan / HY-WU
View on GitHub
HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing
☆295Mar 18, 2026Updated 4 months ago
xyupeng / ContrastiveCrop
View on GitHub
[CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning
☆289Jun 27, 2022Updated 4 years ago
tmllab / 2023_ICLR_Moderate-DS
View on GitHub
☆33Mar 24, 2023Updated 3 years ago
ichbill / LTDD
View on GitHub
Official Implementation of paper "Distilling Long-tailed Datasets" [CVPR 2025]
☆24Aug 13, 2025Updated 11 months ago
Jiachen-T-Wang / GREATS
View on GitHub
☆20Jun 27, 2026Updated 3 weeks ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
yangluo7 / CAME
View on GitHub
[ACL 2023] The official implementation of "CAME: Confidence-guided Adaptive Memory Optimization"
☆100Mar 22, 2025Updated last year
LINs-lab / RDED
View on GitHub
[CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm
☆85Feb 24, 2025Updated last year
vimar-gu / SSD
View on GitHub
[AAAI2024] Summarizing Stream Data for Memory-Restricted Online Continual Learning
☆22Apr 30, 2024Updated 2 years ago
MIV-XJTU / SPEED
View on GitHub
PyTorch implementation of paper "Sparse Parameterization for Epitomic Dataset Distillation" in NeurIPS 2023.
☆20Jun 28, 2024Updated 2 years ago
VincenDen / IID
View on GitHub
Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation (CVPR24)
☆10Jun 16, 2024Updated 2 years ago
yangluo7 / V-ReasonBench
View on GitHub
☆36Feb 18, 2026Updated 5 months ago
PatrickZH / Awesome-Coreset-Selection
View on GitHub
Awesome coreset/core-set/subset/sample selection works.
☆184Jun 30, 2024Updated 2 years ago