MCG-NJU / FlowDCNLinks
[NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
☆33Updated 7 months ago
Alternatives and similar repositories for FlowDCN
Users that are interested in FlowDCN are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling☆21Updated last month
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆81Updated 8 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆69Updated 9 months ago
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆105Updated 4 months ago
- ☆133Updated last year
- ☆65Updated last week
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆63Updated 5 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated last year
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆53Updated 4 months ago
- [ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆165Updated last year
- AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model☆41Updated last month
- [CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation☆34Updated last month
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆83Updated 6 months ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆45Updated last year
- The official implementation of "[MASK] is All You Need"☆122Updated 2 weeks ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆38Updated 6 months ago
- ☆30Updated last year
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆32Updated 8 months ago
- Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆65Updated 3 weeks ago
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆89Updated last month
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆45Updated last week
- Test-Time Training on Video Streams☆64Updated 2 years ago
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆51Updated 4 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆39Updated 5 months ago
- (SRA) No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves☆80Updated 2 weeks ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆28Updated last year
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆86Updated 10 months ago
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆55Updated 2 years ago
- Autoregressive Image Generation with Randomized Parallel Decoding☆70Updated 4 months ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆88Updated last year