LTH14 / mage
A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis
☆557Updated 2 years ago
Alternatives and similar repositories for mage:
Users that are interested in mage are comparing it to the libraries listed below
- PyTorch implementation of RCG https://arxiv.org/abs/2312.03701☆911Updated 6 months ago
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆557Updated 11 months ago
- Label-Efficient Semantic Segmentation with Diffusion Models (ICLR'2022)☆696Updated 2 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆501Updated 2 years ago
- Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training☆449Updated last year
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆995Updated 2 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆968Updated 2 years ago
- Official Open Source code for "Scaling Language-Image Pre-training via Masking"☆420Updated 2 years ago
- Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)☆435Updated last year
- Official Jax Implementation of MaskGIT☆503Updated 2 years ago
- An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch☆300Updated last week
- A simple way to keep track of an Exponential Moving Average (EMA) version of your Pytorch model☆569Updated 4 months ago
- ☆459Updated 2 years ago
- [ICCV 2023] A latent space for stochastic diffusion models☆620Updated last year
- Official Implementation of Rectified Flow (ICLR2023 Spotlight)☆1,197Updated 8 months ago
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆490Updated 5 months ago
- [ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"☆315Updated 3 months ago
- This repository categorizes the papers about diffusion models applied in computer vision according to their target task. The classifcatio…☆394Updated last year
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,467Updated 6 months ago
- [CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space☆315Updated 11 months ago
- Reading list for research topics in Masked Image Modeling☆332Updated 4 months ago
- Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.☆757Updated 2 years ago
- [ICLR 2023 Oral] Image as Set of Points☆564Updated 11 months ago
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆532Updated last year
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆570Updated 2 years ago
- Official implementation and data release of the paper "Visual Prompting via Image Inpainting".☆309Updated last year
- Official Implementation of SinDiffusion: Learning a Diffusion Model from a Single Natural Image☆294Updated 2 years ago
- ❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119☆1,102Updated last year
- MetaFormer Baselines for Vision (TPAMI 2024)☆457Updated 10 months ago
- A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).☆816Updated 9 months ago