fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability.
☆1,833Feb 18, 2026Updated last week
Alternatives and similar repositories for fastdup
Users that are interested in fastdup are comparing it to the libraries listed below
Sorting:
- Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data …☆11,346Jan 13, 2026Updated last month
- Easily compute clip embeddings and build a clip retrieval system with them☆2,730Aug 15, 2025Updated 6 months ago
- Refine high-quality datasets and visual AI models☆10,410Updated this week
- Automatically find issues in image datasets and practice data-centric computer vision.☆1,158Jan 8, 2026Updated last month
- A python library for self-supervised learning on images.☆3,690Feb 23, 2026Updated last week
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆4,371Oct 19, 2025Updated 4 months ago
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…☆3,982Dec 28, 2025Updated 2 months ago
- Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.☆5,007Feb 24, 2026Updated last week
- Framework agnostic sliced/tiled inference + interactive ui + error analysis plots☆5,137Feb 12, 2026Updated 2 weeks ago
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,985Jun 16, 2024Updated last year
- An open source implementation of CLIP.☆13,430Updated this week
- An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come☆870Nov 26, 2024Updated last year
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,999Mar 21, 2024Updated last year
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,295Mar 3, 2024Updated 2 years ago
- Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125☆15,268Jun 25, 2025Updated 8 months ago
- 🐍 Geometric Computer Vision Library for Spatial AI☆11,093Feb 23, 2026Updated last week
- The WeightWatcher tool for predicting the accuracy of Deep Neural Networks☆1,724Updated this week
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,397Feb 23, 2026Updated last week
- Sparsity-aware deep learning inference runtime for CPUs☆3,159Jun 2, 2025Updated 9 months ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆12,427Feb 24, 2026Updated last week
- A data augmentations library for audio, image, text, and video.☆5,071Feb 13, 2026Updated 2 weeks ago
- Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.☆6,017Updated this week
- Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆17,409Sep 5, 2024Updated last year
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,368May 19, 2025Updated 9 months ago
- Segment Anything in High Quality [NeurIPS 2023]☆4,179Sep 12, 2025Updated 5 months ago
- Images to inference with no labeling (use foundation models to train supervised models).☆2,634May 14, 2025Updated 9 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,167Nov 18, 2024Updated last year
- 🦘 Explore multimedia datasets at scale☆1,062Dec 7, 2024Updated last year
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆897Nov 4, 2025Updated 3 months ago
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆53,497Sep 18, 2024Updated last year
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,706Jan 12, 2026Updated last month
- Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stre…☆9,012Feb 16, 2026Updated 2 weeks ago
- Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupe…☆1,059Jun 4, 2025Updated 8 months ago
- Grounded Language-Image Pre-training☆2,572Jan 24, 2024Updated 2 years ago
- Lightweight Python library for adding real-time multi-object tracking to any detector.☆2,618Apr 30, 2025Updated 10 months ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,033Jan 23, 2026Updated last month
- Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains☆1,731Oct 8, 2023Updated 2 years ago
- Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)☆939Nov 7, 2023Updated 2 years ago
- Containers for machine learning☆9,252Updated this week