Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.
☆349May 24, 2023Updated 3 years ago
Alternatives and similar repositories for cords
Users that are interested in cords are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficiently search and mine for specific (targeted) classes/slices in your dataset to improve model performance and personalize your mode…☆20Nov 17, 2023Updated 2 years ago
- Data-efficient Training of Machine Learning Models☆72Dec 7, 2020Updated 5 years ago
- Code for coreset selection methods☆255Feb 27, 2023Updated 3 years ago
- DISTIL: Deep dIverSified inTeractIve Learning. An active/inter-active learning library built on py-torch for reducing labeling costs.☆156Feb 5, 2023Updated 3 years ago
- Summarize Massive Datasets using Submodular Optimization☆129May 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Coresets via Bilevel Optimization☆68Nov 2, 2020Updated 5 years ago
- SPEAR: Programmatically label and build training data quickly.☆111Jun 27, 2024Updated last year
- ☆97Jan 14, 2021Updated 5 years ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- [NeurIPS 2023] Towards Free Data Selection with General-Purpose Models☆42Mar 14, 2025Updated last year
- ☆112Jun 20, 2023Updated 2 years ago
- BackPACK - a backpropagation package built on top of PyTorch which efficiently computes quantities other than the gradient.☆615Nov 28, 2025Updated 6 months ago
- Material for my course: Optimization in Machine Learning☆32Mar 9, 2021Updated 5 years ago
- apricot implements submodular optimization for the purpose of selecting subsets of massive data sets to train machine learning models qui…☆531Nov 17, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Pytorch dataloader and pytorch lightning datamodule for Earth Observation imagery☆15Apr 7, 2022Updated 4 years ago
- [ICML 2021] "Efficient Lottery Ticket Finding: Less Data is More" by Zhenyu Zhang*, Xuxi Chen*, Tianlong Chen*, Zhangyang Wang☆26Dec 30, 2021Updated 4 years ago
- Automated Scalable Bayesian Inference☆132Jan 16, 2022Updated 4 years ago
- Code for paper: “What Data Benefits My Classifier?” Enhancing Model Performance and Interpretability through Influence-Based Data Selecti…☆23May 17, 2024Updated 2 years ago
- This repository consists of useful tools or guides for system software development or anything interesting.☆11Feb 27, 2026Updated 3 months ago
- ☆180Jul 25, 2024Updated last year
- Guarantees on the behavior of neural networks don't always have to come at the cost of performance.☆30Oct 12, 2022Updated 3 years ago
- Coresets☆38Apr 24, 2022Updated 4 years ago
- Collection of simple functions reusable across ML projects.☆20Jul 6, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Bayesian active learning library for research and industrial usecases.☆931Dec 3, 2025Updated 6 months ago
- PyTorch implementation of consistency regularization methods for semi-supervised learning☆81Aug 20, 2020Updated 5 years ago
- Codebase for "Online Fast Adaptation and Knowledge Accumulation: a New Approach to Continual Learning". This is a ServiceNow Research pro…☆106Jun 27, 2022Updated 3 years ago
- Awesome Active Learning Paper List☆149Apr 23, 2024Updated 2 years ago
- Official PyTorch implementation of "EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization"☆23Oct 24, 2021Updated 4 years ago
- A curated list of awesome Active Learning☆802Mar 26, 2026Updated 2 months ago
- Dataset Condensation (ICLR21 and ICML21)☆542Nov 27, 2023Updated 2 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- A Survey of Dataset Refinement for Problems in Computer Vision Datasets☆34Sep 12, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CS 7301: Spring 2021 Course on Advanced Topics in Optimization in Machine Learning☆182Apr 16, 2021Updated 5 years ago
- Source code for ICLR 2018 Paper: Active Learning for Convolutional Neural Networks: A Core-Set Approach☆279Oct 23, 2018Updated 7 years ago
- Fast and memory-efficient clustering + coreset construction, including fast distance kernels for Bregman and f-divergences.☆34Sep 6, 2023Updated 2 years ago
- A curated list of awesome papers on dataset distillation and related applications.☆1,944Updated this week
- An automated feature engineering framework 'FETCH' accepted in ICLR 2023.☆11Jun 20, 2023Updated 2 years ago
- Avalanche: an End-to-End Library for Continual Learning based on PyTorch.☆2,055Mar 11, 2025Updated last year
- Open-source code for paper "Dataset Distillation"☆824Jun 17, 2025Updated 11 months ago