robvanvolt / DALLE-tools
DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.
☆15Updated 2 years ago
Alternatives and similar repositories for DALLE-tools:
Users that are interested in DALLE-tools are comparing it to the libraries listed below
- Describe the format of image/text datasets☆11Updated 2 years ago
- ☆15Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- Load any clip model with a standardized interface☆21Updated 10 months ago
- Utilities for PyTorch distributed☆23Updated this week
- CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell☆14Updated last year
- Implementation of Metaformer, but in an autoregressive manner☆23Updated 2 years ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 7 months ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Updated 2 years ago
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15Updated 3 years ago
- ☆17Updated last year
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆15Updated 3 years ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 3 months ago
- Finetune the 1.4B latent diffusion text2img-large checkpoint from CompVis using deepspeed. (work-in-progress)☆36Updated 2 years ago
- Generate images from texts. In Russian☆19Updated 3 years ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Updated 2 years ago
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆26Updated 2 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆25Updated 3 years ago
- Visionner turn raw image data into numpy array, more suitable for deep learning task☆10Updated last year
- ☆20Updated 3 years ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆17Updated 4 months ago
- Official repository for MaGNET, ICLR 2022☆24Updated 2 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 2 years ago
- ☆32Updated 4 months ago
- ☆21Updated 2 months ago
- The original weights of some Caffe models, ported to PyTorch.☆11Updated 3 years ago
- Visual search interface☆11Updated 3 years ago
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 3 weeks ago