Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.
☆346May 24, 2023Updated 2 years ago
Alternatives and similar repositories for cords
Users that are interested in cords are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data-efficient Training of Machine Learning Models☆71Dec 7, 2020Updated 5 years ago
- DISTIL: Deep dIverSified inTeractIve Learning. An active/inter-active learning library built on py-torch for reducing labeling costs.☆155Feb 5, 2023Updated 3 years ago
- Awesome coreset/core-set/subset/sample selection works.☆181Jun 30, 2024Updated last year
- Coresets via Bilevel Optimization☆68Nov 2, 2020Updated 5 years ago
- SPEAR: Programmatically label and build training data quickly.☆110Jun 27, 2024Updated last year
- ☆42Sep 21, 2023Updated 2 years ago
- ☆97Jan 14, 2021Updated 5 years ago
- Code for "Hitting the Target: Stopping Active Learning at the Cost-Based Optimum"☆13Aug 5, 2022Updated 3 years ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆14Jul 21, 2024Updated last year
- [NeurIPS 2023] Towards Free Data Selection with General-Purpose Models☆41Mar 14, 2025Updated last year
- ☆113Jun 20, 2023Updated 2 years ago
- BackPACK - a backpropagation package built on top of PyTorch which efficiently computes quantities other than the gradient.☆606Nov 28, 2025Updated 3 months ago
- Material for my course: Optimization in Machine Learning☆32Mar 9, 2021Updated 5 years ago
- apricot implements submodular optimization for the purpose of selecting subsets of massive data sets to train machine learning models qui…☆528Nov 17, 2025Updated 4 months ago
- ☆12Mar 3, 2022Updated 4 years ago
- Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models☆34Sep 19, 2025Updated 6 months ago
- Pytorch dataloader and pytorch lightning datamodule for Earth Observation imagery☆15Apr 7, 2022Updated 3 years ago
- Automated Scalable Bayesian Inference☆132Jan 16, 2022Updated 4 years ago
- ☆179Jul 25, 2024Updated last year
- Guarantees on the behavior of neural networks don't always have to come at the cost of performance.☆30Oct 12, 2022Updated 3 years ago
- Bayesian active learning library for research and industrial usecases.☆922Dec 3, 2025Updated 3 months ago
- Coresets☆38Apr 24, 2022Updated 3 years ago
- Collection of simple functions reusable across ML projects.☆20Jul 6, 2021Updated 4 years ago
- PyTorch implementation of consistency regularization methods for semi-supervised learning☆80Aug 20, 2020Updated 5 years ago
- Codebase for "Online Fast Adaptation and Knowledge Accumulation: a New Approach to Continual Learning". This is a ServiceNow Research pro…☆106Jun 27, 2022Updated 3 years ago
- SelectiveBackprop accelerates training by dynamically prioritizing useful examples with high loss☆32Mar 12, 2020Updated 6 years ago
- Awesome Active Learning Paper List☆148Apr 23, 2024Updated last year
- Official PyTorch implementation of "EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization"☆23Oct 24, 2021Updated 4 years ago
- A curated list of awesome Active Learning☆796Oct 20, 2024Updated last year
- [NeurIPS 2020] Coresets for Robust Training of Neural Networks against Noisy Labels☆36May 2, 2021Updated 4 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- Torch Distributed Experimental☆117Aug 5, 2024Updated last year
- A Survey of Dataset Refinement for Problems in Computer Vision Datasets☆34Sep 12, 2025Updated 6 months ago
- ☆11Jun 2, 2021Updated 4 years ago
- Source code for ICLR 2018 Paper: Active Learning for Convolutional Neural Networks: A Core-Set Approach☆279Oct 23, 2018Updated 7 years ago
- A curated list of awesome papers on dataset distillation and related applications.☆1,913Updated this week
- Avalanche: an End-to-End Library for Continual Learning based on PyTorch.☆2,041Mar 11, 2025Updated last year
- Open-source code for paper "Dataset Distillation"☆824Jun 17, 2025Updated 9 months ago
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,986Jun 16, 2024Updated last year