LTH14 / mage
A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis
☆551Updated last year
Alternatives and similar repositories for mage:
Users that are interested in mage are comparing it to the libraries listed below
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆547Updated 9 months ago
- PyTorch implementation of RCG https://arxiv.org/abs/2312.03701☆902Updated 4 months ago
- Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training☆438Updated 11 months ago
- Label-Efficient Semantic Segmentation with Diffusion Models (ICLR'2022)☆682Updated last year
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆491Updated last year
- [ICCV 2023] A latent space for stochastic diffusion models☆599Updated last year
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆966Updated last year
- Official Open Source code for "Scaling Language-Image Pre-training via Masking"☆413Updated last year
- Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)☆422Updated last year
- This repository categorizes the papers about diffusion models applied in computer vision according to their target task. The classifcatio…☆392Updated last year
- [CVPR 2022] Official PyTorch Implementation for DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models☆824Updated last year
- [ICLR 2023 Oral] Image as Set of Points☆553Updated 9 months ago
- MetaFormer Baselines for Vision (TPAMI 2024)☆443Updated 8 months ago
- Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.☆749Updated 2 years ago
- Reading list for research topics in Masked Image Modeling☆331Updated 2 months ago
- Official Jax Implementation of MaskGIT☆488Updated 2 years ago
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆487Updated 3 months ago
- ☆453Updated 2 years ago
- Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)☆491Updated 7 months ago
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆526Updated last year
- Official Implementation of Semantic Image Synthesis via Diffusion Models☆239Updated 2 years ago
- Dual Diffusion Implicit Bridges for Image-to-Image Translation. ICLR 2023.☆374Updated 2 years ago
- Official PyTorch implementation of Video Probabilistic Diffusion Models in Projected Latent Space (CVPR 2023).☆314Updated 9 months ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆950Updated 2 years ago
- An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch☆296Updated last year
- iBOT : Image BERT Pre-Training with Online Tokenizer (ICLR 2022)☆705Updated 2 years ago
- A simple way to keep track of an Exponential Moving Average (EMA) version of your Pytorch model☆559Updated 2 months ago
- Official implementation and data release of the paper "Visual Prompting via Image Inpainting".☆305Updated last year
- Unofficial PyTorch implementation of Denoising Diffusion Probabilistic Models☆541Updated 8 months ago
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆562Updated 2 years ago