Photoroom / dataroomLinks
Framework based on a vector dabase to store, manage and curate large image datasets
☆80Updated 2 months ago
Alternatives and similar repositories for dataroom
Users that are interested in dataroom are comparing it to the libraries listed below
Sorting:
- A natively parallel dataloader for Python, written in Rust. Serving data at GB/s speeds, while covering aspect ratio bucketing, crop and …☆119Updated 3 weeks ago
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆43Updated this week
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 11 months ago
- Writing FLUX in Triton☆41Updated last year
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 8 months ago
- ☆91Updated last year
- A Gradio component that can be used to annotate images with bounding boxes.☆63Updated 3 weeks ago
- Train vision models using JAX and 🤗 transformers☆100Updated 2 weeks ago
- ☆59Updated last year
- Timm model explorer☆42Updated last year
- The Generative Landscape - a course on generative modelling (currently unfinished). Join our Discord: https://discord.gg/vSjhr8xb4g☆147Updated 2 years ago
- Production-ready data processing made easy and shareable☆353Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆159Updated last year
- ☆211Updated last week
- High-throughput tensor loading for PyTorch☆197Updated last week
- Unified storage framework for the entire machine learning lifecycle☆155Updated last year
- An implementation of the Llama architecture, to instruct and delight☆21Updated 5 months ago
- ☆86Updated 4 months ago
- ☆16Updated last year
- ☆29Updated 4 months ago
- Train fastai models faster (and other useful tools)☆72Updated 5 months ago
- Focused on fast experimentation and simplicity☆75Updated 10 months ago
- A miniature version of Modal☆21Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆22Updated last year
- ☆20Updated last year
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆123Updated last year
- Fast, Modern, and Low Precision PyTorch Optimizers☆116Updated 2 months ago
- Scalable and Performant Data Loading☆335Updated this week
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- Efficiently read embedding in streaming from any filesystem☆102Updated 3 months ago