rom1504 / embedding-reader
Efficiently read embedding in streaming from any filesystem
☆99Updated 11 months ago
Alternatives and similar repositories for embedding-reader:
Users that are interested in embedding-reader are comparing it to the libraries listed below
- Simple python template☆41Updated 11 months ago
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆218Updated 10 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆131Updated last year
- CLOOB training (JAX) and inference (JAX and PyTorch)☆71Updated 2 years ago
- JAX implementation ViT-VQGAN☆82Updated 2 years ago
- ☆103Updated last year
- ☆64Updated last year
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆96Updated 2 years ago
- A repository containing datasets and tools to train a watermark classifier.☆66Updated 2 years ago
- ☆111Updated 3 years ago
- Aim for the moon. If you miss, you may hit a star.☆164Updated 2 years ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆317Updated last year
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- Finetune glide-text2im from openai on your own data.☆89Updated 2 years ago
- Let's make a video clip☆93Updated 2 years ago
- Aggregating embeddings over time☆31Updated 2 years ago
- ☆157Updated 2 years ago
- Easily compute clip embeddings from video frames☆144Updated last year
- M4 experiment logbook☆57Updated last year
- Minimal Differentiable Image Reward Functions☆52Updated last month
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆84Updated 2 years ago
- Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training☆166Updated last year
- ☆114Updated 2 years ago
- Open reproduction of MUSE for fast text2image generation.☆348Updated 10 months ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆200Updated 7 months ago
- Tools for content datamining and NLP at scale☆43Updated 9 months ago
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Updated 3 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Updated 3 years ago
- ☆30Updated 3 years ago
- Contrastive Language-Image Pretraining☆142Updated 2 years ago