rom1504 / embedding-readerLinks
Efficiently read embedding in streaming from any filesystem
☆98Updated last year
Alternatives and similar repositories for embedding-reader
Users that are interested in embedding-reader are comparing it to the libraries listed below
Sorting:
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆219Updated last year
- Simple python template☆41Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆131Updated 2 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆71Updated 3 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆96Updated 2 years ago
- ☆103Updated last year
- ☆111Updated 3 years ago
- Let's make a video clip☆92Updated 2 years ago
- Easily compute clip embeddings from video frames☆145Updated last year
- ☆159Updated 2 years ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆316Updated last year
- ☆64Updated last year
- Aim for the moon. If you miss, you may hit a star.☆164Updated 2 years ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆202Updated 9 months ago
- Finetune glide-text2im from openai on your own data.☆89Updated 2 years ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆314Updated last year
- ☆165Updated 2 years ago
- Benchmarking Generative Models with Artworks☆228Updated 2 years ago
- ☆116Updated 2 years ago
- Aggregating embeddings over time☆31Updated 2 years ago
- Description and pointers of laion datasets☆245Updated 2 years ago
- Iterable datapipelines for pytorch training.☆82Updated 9 months ago
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆86Updated 2 years ago
- GPU controlled Hetzner Cloud workers swarm for Crawling@Home project☆53Updated 2 years ago
- Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt☆137Updated last year
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Updated 3 years ago
- Release of ImageNet-Captions☆48Updated 2 years ago
- cheap views of intermediate Stable Diffusion results☆46Updated 2 years ago
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆391Updated 2 years ago
- CLOOB Conditioned Latent Diffusion training and inference code☆112Updated 3 years ago