patil-suraj / vit-vqganView external linksLinks
JAX implementation ViT-VQGAN
☆82Sep 21, 2022Updated 3 years ago
Alternatives and similar repositories for vit-vqgan
Users that are interested in vit-vqgan are comparing it to the libraries listed below
Sorting:
- Minimal JAX/Flax port of `lpips` supporting `vgg16`, with pre-trained weights stored in the 🤗 Hugging Face hub.☆17Aug 1, 2022Updated 3 years ago
- An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch☆321Apr 7, 2025Updated 10 months ago
- JAX implementation of VQGAN☆91Jul 9, 2022Updated 3 years ago
- Train vision models using JAX and 🤗 transformers☆100Dec 14, 2025Updated 2 months ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Aug 30, 2021Updated 4 years ago
- JAX implementation ViT-VQGAN☆63Jul 23, 2022Updated 3 years ago
- Implementing the Denoising Diffusion Probabilistic Model in Flax☆157Nov 1, 2022Updated 3 years ago
- Un-*** 50 billions multimodality dataset☆23Sep 14, 2022Updated 3 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆34Jan 28, 2023Updated 3 years ago
- ☆91Sep 19, 2022Updated 3 years ago
- Latent Diffusion Language Models☆70Sep 20, 2023Updated 2 years ago
- Official implementation of VQ-Diffusion☆977Apr 17, 2024Updated last year
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆80Jan 7, 2026Updated last month
- Fastai + PyTorch DDP in Jupyter Notebook☆27Sep 29, 2020Updated 5 years ago
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆722Oct 16, 2023Updated 2 years ago
- Optimized library for large-scale extraction of frames and audio from video.☆204Sep 11, 2023Updated 2 years ago
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆88Jul 9, 2023Updated 2 years ago
- A JAX implementation of the continuous time formulation of Consistency Models☆85Apr 7, 2023Updated 2 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆207Aug 26, 2023Updated 2 years ago
- Simple large-scale training of stable diffusion with multi-node support.☆133May 8, 2023Updated 2 years ago
- Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.☆265Mar 21, 2025Updated 10 months ago
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆18Nov 1, 2023Updated 2 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆90Oct 11, 2024Updated last year
- Code for the ECCV 2022 paper "Unleashing Transformers"☆185Apr 17, 2023Updated 2 years ago
- My NER Experiments with ModernBERT and Ettin☆26Jul 17, 2025Updated 7 months ago
- ☆21Mar 15, 2023Updated 2 years ago
- Object recognition with Pepper using a deep learning model☆10Sep 16, 2021Updated 4 years ago
- Directed masked autoencoders☆14Feb 5, 2026Updated last week
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆85May 28, 2022Updated 3 years ago
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆97Nov 2, 2022Updated 3 years ago
- Image and video processing toolbox☆10Jun 12, 2020Updated 5 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC☆14Apr 17, 2023Updated 2 years ago
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)☆997Jan 3, 2024Updated 2 years ago
- ☆24Jun 4, 2024Updated last year
- Diffusion Reading Group at EleutherAI☆335Aug 8, 2023Updated 2 years ago
- A CLI tool for using GLIDE to generate images from text.☆67May 5, 2022Updated 3 years ago