rom1504 / embedding-reader
Efficiently read embedding in streaming from any filesystem
☆97Updated 9 months ago
Alternatives and similar repositories for embedding-reader:
Users that are interested in embedding-reader are comparing it to the libraries listed below
- Simple python template☆40Updated 9 months ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆70Updated 2 years ago
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆215Updated 8 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆127Updated last year
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆94Updated last year
- ☆107Updated 2 years ago
- Iterable datapipelines for pytorch training.☆81Updated 5 months ago
- ☆101Updated last year
- ☆111Updated 3 years ago
- ☆64Updated last year
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Updated 2 years ago
- A repository containing datasets and tools to train a watermark classifier.☆64Updated 2 years ago
- ☆155Updated 2 years ago
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆88Updated 2 years ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆315Updated last year
- Easily compute clip embeddings from video frames☆140Updated last year
- Finetune glide-text2im from openai on your own data.☆88Updated 2 years ago
- Aim for the moon. If you miss, you may hit a star.☆162Updated last year
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆98Updated last year
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆193Updated 5 months ago
- ☆163Updated last year
- JAX implementation ViT-VQGAN☆80Updated 2 years ago
- Train vision models using JAX and 🤗 transformers☆95Updated this week
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch☆50Updated 2 years ago
- A CLI tool for using GLIDE to generate images from text.☆68Updated 2 years ago
- Finetune the 1.4B latent diffusion text2img-large checkpoint from CompVis using deepspeed. (work-in-progress)☆36Updated 2 years ago
- Description and pointers of laion datasets☆241Updated 2 years ago
- Let's make a video clip☆93Updated 2 years ago
- Release of ImageNet-Captions☆45Updated 2 years ago
- Official implementation of "Active Image Indexing"☆58Updated last year