glami / glami-1mLinks
The largest multilingual image-text classification dataset. It contains fashion products.
☆75Updated 2 years ago
Alternatives and similar repositories for glami-1m
Users that are interested in glami-1m are comparing it to the libraries listed below
Sorting:
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆70Updated last month
- Efficiently read embedding in streaming from any filesystem☆104Updated 5 months ago
- Semantic search with embeddings: index anything☆140Updated 3 years ago
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆49Updated 3 years ago
- CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell☆14Updated 2 years ago
- ☆87Updated 2 years ago
- Tools for content datamining and NLP at scale☆44Updated last year
- State-of-the-art CLIP/SigLIP embedding models finetuned for the fashion domain. +57% increase in evaluation metrics vs FashionCLIP 2.0.☆121Updated last year
- Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge☆45Updated last year
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆87Updated 3 years ago
- This is the official repository for the paper "OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data". …☆71Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- ☆103Updated 2 years ago
- Load any clip model with a standardized interface☆22Updated 3 months ago
- Official implementation of "Active Image Indexing"☆60Updated 2 years ago
- Python package to generate image embeddings with CLIP without PyTorch/TensorFlow☆158Updated 3 years ago
- H&M Fashion Image similarity search with Weaviate and DocArray☆43Updated last year
- GPU controlled Hetzner Cloud workers swarm for Crawling@Home project☆58Updated 3 years ago
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆223Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆100Updated 2 years ago
- Simple python template☆42Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆35Updated 2 years ago
- ☆23Updated last year
- 🤝 Trade any tensors over the network☆30Updated 2 years ago
- ☆28Updated 2 years ago
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆32Updated 2 years ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- ☆13Updated 3 years ago