glami / glami-1mLinks
The largest multilingual image-text classification dataset. It contains fashion products.
☆75Updated 2 years ago
Alternatives and similar repositories for glami-1m
Users that are interested in glami-1m are comparing it to the libraries listed below
Sorting:
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆68Updated 2 months ago
- Efficiently read embedding in streaming from any filesystem☆103Updated 4 months ago
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆49Updated 3 years ago
- Semantic search with embeddings: index anything☆140Updated 3 years ago
- Load any clip model with a standardized interface☆22Updated last month
- CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell☆14Updated 2 years ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge☆45Updated last year
- Python package to generate image embeddings with CLIP without PyTorch/TensorFlow☆157Updated 3 years ago
- GPU controlled Hetzner Cloud workers swarm for Crawling@Home project☆58Updated 3 years ago
- This is the official repository for the paper "OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data". …☆71Updated last year
- Simply, faster, sentence-transformers☆143Updated last year
- ☆87Updated last year
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- ☆103Updated last year
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆87Updated 3 years ago
- State-of-the-art CLIP/SigLIP embedding models finetuned for the fashion domain. +57% increase in evaluation metrics vs FashionCLIP 2.0.☆118Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆44Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆223Updated last year
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆31Updated 2 years ago
- ☆28Updated 2 years ago
- Simple python template☆42Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- ☆13Updated 3 years ago
- [WIP] A 🔥 interface for running code in the cloud☆86Updated 2 years ago
- Index of URLs to pdf files all over the internet and scripts☆25Updated 2 years ago
- CLIP (Contrastive Language–Image Pre-training) for Italian☆185Updated 2 years ago
- H&M Fashion Image similarity search with Weaviate and DocArray☆43Updated last year
- ☆65Updated 2 years ago