fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability.
☆1,834Feb 18, 2026Updated 3 weeks ago
Alternatives and similar repositories for fastdup
Users that are interested in fastdup are comparing it to the libraries listed below
Sorting:
- Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data …☆11,368Jan 13, 2026Updated 2 months ago
- Easily compute clip embeddings and build a clip retrieval system with them☆2,733Aug 15, 2025Updated 7 months ago
- Refine high-quality datasets and visual AI models☆10,448Updated this week
- Automatically find issues in image datasets and practice data-centric computer vision.☆1,158Jan 8, 2026Updated 2 months ago
- A python library for self-supervised learning on images.☆3,699Mar 4, 2026Updated last week
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆4,380Oct 19, 2025Updated 4 months ago
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…☆3,990Dec 28, 2025Updated 2 months ago
- Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.☆5,015Feb 24, 2026Updated 3 weeks ago
- Framework agnostic sliced/tiled inference + interactive ui + error analysis plots☆5,157Updated this week
- An open source implementation of CLIP.☆13,496Updated this week
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,986Jun 16, 2024Updated last year
- An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come☆870Nov 26, 2024Updated last year
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,997Mar 21, 2024Updated last year
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,294Mar 3, 2024Updated 2 years ago
- Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125☆15,275Jun 25, 2025Updated 8 months ago
- 🐍 Geometric Computer Vision Library for Spatial AI☆11,114Mar 9, 2026Updated last week
- The WeightWatcher tool for predicting the accuracy of Deep Neural Networks☆1,730Feb 26, 2026Updated 2 weeks ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,504Updated this week
- Sparsity-aware deep learning inference runtime for CPUs☆3,163Jun 2, 2025Updated 9 months ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆12,505Updated this week
- Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.☆6,033Updated this week
- A data augmentations library for audio, image, text, and video.☆5,070Mar 10, 2026Updated last week
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,375May 19, 2025Updated 9 months ago
- Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆17,450Sep 5, 2024Updated last year
- Segment Anything in High Quality [NeurIPS 2023]☆4,186Sep 12, 2025Updated 6 months ago
- Images to inference with no labeling (use foundation models to train supervised models).☆2,644May 14, 2025Updated 10 months ago
- 🦘 Explore multimedia datasets at scale☆1,063Dec 7, 2024Updated last year
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,183Nov 18, 2024Updated last year
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆898Nov 4, 2025Updated 4 months ago
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆53,627Sep 18, 2024Updated last year
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,708Mar 9, 2026Updated last week
- Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stre…☆9,033Feb 16, 2026Updated last month
- Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupe…☆1,061Jun 4, 2025Updated 9 months ago
- Grounded Language-Image Pre-training☆2,576Jan 24, 2024Updated 2 years ago
- Lightweight Python library for adding real-time multi-object tracking to any detector.☆2,620Apr 30, 2025Updated 10 months ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,046Jan 23, 2026Updated last month
- Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains☆1,731Oct 8, 2023Updated 2 years ago
- Containers for machine learning☆9,268Updated this week
- COYO-700M: Large-scale Image-Text Pair Dataset☆1,251Nov 30, 2022Updated 3 years ago