Photoroom / dataroomLinks
Framework based on a vector dabase to store, manage and curate large image datasets
☆80Updated 3 months ago
Alternatives and similar repositories for dataroom
Users that are interested in dataroom are comparing it to the libraries listed below
Sorting:
- A natively parallel dataloader for Python, written in Rust. Serving data at GB/s speeds, while covering aspect ratio bucketing, crop and …☆123Updated this week
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆45Updated this week
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆69Updated 7 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated last year
- ☆59Updated last year
- A Gradio component that can be used to annotate images with bounding boxes.☆64Updated last month
- Production-ready data processing made easy and shareable☆356Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- ☆213Updated this week
- Notebooks to demonstrate TimmWrapper☆16Updated 11 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 9 months ago
- Notebooks for fine tuning pali gemma☆117Updated 8 months ago
- Unified storage framework for the entire machine learning lifecycle☆155Updated last year
- Timm model explorer☆42Updated last year
- Fast, Modern, and Low Precision PyTorch Optimizers☆116Updated 3 months ago
- ☆29Updated 2 years ago
- ☆91Updated last year
- Faster generation with text-to-image diffusion models.☆231Updated 5 months ago
- Supercharge Your PyTorch Image Models: Bag of Tricks to 8x Faster Inference with ONNX Runtime & Optimizations☆23Updated last year
- ☆288Updated last week
- Focused on fast experimentation and simplicity☆76Updated 11 months ago
- Simple python template☆42Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆44Updated last year
- Multi-backend recommender systems with Keras 3☆149Updated this week
- High-throughput tensor loading for PyTorch☆211Updated last week
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead…☆1,045Updated last week
- Lightning HPO & Training Studio App☆19Updated 2 years ago
- CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell☆14Updated 2 years ago
- ☆16Updated last year