xie-lab-ml / Meissonic-InferenceLinks
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
☆16Updated last year
Alternatives and similar repositories for Meissonic-Inference
Users that are interested in Meissonic-Inference are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆108Updated 4 months ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Updated last year
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆140Updated last week
- Evaluation codes and data for GenEval2☆55Updated last month
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆47Updated last year
- [ICLR2026] The official code of "Weak-to-Strong Diffusion with Reflection".☆55Updated last week
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆55Updated 5 months ago
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆114Updated 7 months ago
- a collection of awesome autoregressive visual generation models☆79Updated 9 months ago
- [ICLR 2024] Official pytorch implementation of "Denoising Task Routing for Diffusion Models"☆24Updated last year
- Training Autoregressive Image Generation models via Reinforcement Learning☆50Updated 2 months ago
- [ICLR 2025, AAAI 2026] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generati…☆34Updated 2 weeks ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆172Updated last month
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆128Updated last year
- Adapting LLaMA Decoder to Vision Transformer☆30Updated last year
- [ICLR 2026] Autoregressive Image Generation with Randomized Parallel Decoding☆86Updated 2 weeks ago
- ☆29Updated 10 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆186Updated 8 months ago
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025☆30Updated 6 months ago
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆43Updated 10 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆41Updated 11 months ago
- T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation☆36Updated 4 months ago
- GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆103Updated 2 weeks ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆206Updated 6 months ago
- Unified layout planning and image generation, ICCV2025☆40Updated 3 weeks ago
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆27Updated last year
- A PyTorch implementation of the paper "MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis".☆12Updated 3 years ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆89Updated last year
- An Empirical Study of GPT-4o Image Generation Capabilities☆29Updated 9 months ago
- The official pytorch implementation of “Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization”.☆19Updated 8 months ago