glami / glami-1mLinks
The largest multilingual image-text classification dataset. It contains fashion products.
☆72Updated 2 years ago
Alternatives and similar repositories for glami-1m
Users that are interested in glami-1m are comparing it to the libraries listed below
Sorting:
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆158Updated last year
- ☆87Updated last year
- ☆103Updated last year
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆63Updated 2 months ago
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- Simply, faster, sentence-transformers☆143Updated 10 months ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- Efficiently read embedding in streaming from any filesystem☆100Updated last year
- GPU controlled Hetzner Cloud workers swarm for Crawling@Home project☆54Updated 2 years ago
- Index of URLs to pdf files all over the internet and scripts☆24Updated 2 years ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 10 months ago
- ☆23Updated last year
- Simple python template☆41Updated last year
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch☆50Updated 2 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- Tools for content datamining and NLP at scale☆43Updated last year
- Load any clip model with a standardized interface☆21Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆202Updated 9 months ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- ☆63Updated 9 months ago
- Efficient few-shot learning with cross-encoders.☆53Updated last year
- Application for searching images from natural language queries☆46Updated 3 years ago
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 7 months ago
- Python client for Marqo☆31Updated last week
- CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell☆14Updated 2 years ago
- ☆58Updated last year
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆220Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago