patil-suraj / vit-vqgan
JAX implementation ViT-VQGAN
☆82Updated 2 years ago
Alternatives and similar repositories for vit-vqgan:
Users that are interested in vit-vqgan are comparing it to the libraries listed below
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 2 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆70Updated 2 years ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆84Updated 2 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- FID computation in Jax/Flax.☆27Updated 7 months ago
- ☆28Updated 3 years ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆86Updated 3 years ago
- ☆53Updated 2 years ago
- Simple large-scale training of stable diffusion with multi-node support.☆128Updated last year
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆89Updated 3 years ago
- ☆51Updated last year
- Official implementation of "Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics" (NeurIPS 2023)☆37Updated last year
- A new play-and-plug method of controlling an existing generative model with conditioning attributes and their compositions.☆72Updated 3 years ago
- Simple python template☆40Updated 10 months ago
- ☆72Updated last year
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis☆124Updated 2 years ago
- The implementation for Accelerating Guided Diffusion Sampling with Splitting Numerical Methods (2023)☆48Updated last year
- Aggregating embeddings over time☆31Updated 2 years ago
- Finetune glide-text2im from openai on your own data.☆89Updated 2 years ago
- Official code implementation for our paper -- Direct Inversion: Optimization-Free Text-Driven Real Image Editing with Diffusion Models.☆25Updated 2 years ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.☆35Updated 2 years ago
- Finetune the 1.4B latent diffusion text2img-large checkpoint from CompVis using deepspeed. (work-in-progress)☆36Updated 2 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆100Updated last year
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Updated 2 years ago
- Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…☆201Updated last year
- Evaluation benchmark for the task of Semantic Image Translation. Contains code to run FlexIT (CVPR 2022)☆34Updated 2 years ago
- CLOOB Conditioned Latent Diffusion training and inference code☆112Updated 2 years ago
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch☆50Updated 2 years ago