markweberdev / maskbit
Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"
☆47Updated 2 weeks ago
Alternatives and similar repositories for maskbit:
Users that are interested in maskbit are comparing it to the libraries listed below
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆88Updated last month
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆45Updated 2 months ago
- This is the official implementation for ControlVAR.☆94Updated 2 months ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆36Updated 4 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆78Updated 3 months ago
- ☆132Updated last week
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated 3 weeks ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆102Updated 4 months ago
- ☆43Updated 5 months ago
- ICCV2023-Diffusion-Papers☆109Updated last year
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆50Updated last week
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆74Updated last year
- The official repository of "Spectral Motion Alignment for Video Motion Transfer using Diffusion Models".☆24Updated 2 months ago
- ☆138Updated 2 months ago
- Website source files for Diffusion2GAN Project.☆78Updated 4 months ago
- Training-Free Condition-Guided Text-to-Video Generation☆62Updated last year
- PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following☆28Updated 3 weeks ago
- Denoising Diffusion Step-aware Models (ICLR2024)☆56Updated last year
- Image Neural Field Diffusion Models, CVPR 2024 (Highlight)☆55Updated 3 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆135Updated 8 months ago
- Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos☆49Updated 6 months ago
- [NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation☆58Updated 3 months ago
- [ECCV2024] Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models☆41Updated 7 months ago
- Open implementation of "RandAR"☆53Updated last month
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆84Updated 5 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆42Updated last month
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆28Updated 11 months ago
- This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolu…☆130Updated last week
- [CVPR2024] Official PyTorch implementation of "Contrastive Denoising Score(CDS) for Text-guided Latent Diffusion Image Editing"☆106Updated 3 months ago