patil-suraj / vit-vqgan
JAX implementation ViT-VQGAN
☆77Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for vit-vqgan
- CLOOB training (JAX) and inference (JAX and PyTorch)☆70Updated 2 years ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 2 years ago
- ☆28Updated 2 years ago
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆84Updated 2 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated last year
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆84Updated 2 years ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- ☆50Updated 10 months ago
- Simple python template☆40Updated 6 months ago
- FID computation in Jax/Flax.☆24Updated 4 months ago
- ☆27Updated 2 weeks ago
- Another attempt at a long-context / efficient transformer by me☆37Updated 2 years ago
- Implementation of LogAvgExp for Pytorch☆32Updated 2 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆97Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆30Updated 4 months ago
- A JAX implementation of the continuous time formulation of Consistency Models☆83Updated last year
- OpenAI CLIP based image generator with complex config file controlled transformation and training pipelines☆18Updated 2 years ago
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆87Updated last year
- Implementation of Transframer, Deepmind's U-net + Transformer architecture for up to 30 seconds video generation, in Pytorch☆68Updated 2 years ago
- ☆26Updated 6 months ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Updated 3 years ago
- Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…☆194Updated 9 months ago
- Official repository for MaGNET, ICLR 2022☆26Updated 2 years ago
- Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.☆35Updated 2 years ago
- codebase for the SIMAT dataset and evaluation☆38Updated 2 years ago
- ☆21Updated 5 months ago
- ☆71Updated last year