MCG-NJU / FlowDCN
[NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
☆26Updated 3 weeks ago
Alternatives and similar repositories for FlowDCN:
Users that are interested in FlowDCN are comparing it to the libraries listed below
- ☆52Updated last year
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated last month
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆35Updated 7 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆75Updated 2 months ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆91Updated 6 months ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆24Updated 11 months ago
- Adapting LLaMA Decoder to Vision Transformer☆26Updated 7 months ago
- Officail Repo of γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆28Updated 2 months ago
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆72Updated 3 weeks ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆32Updated 7 months ago
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆26Updated 2 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation☆40Updated last month
- [CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning☆83Updated last year
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"☆43Updated this week
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 2 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆33Updated 6 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆105Updated 2 weeks ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆73Updated last year
- Teach-DETR: Better Training DETR with Teachers☆30Updated 10 months ago
- ☆44Updated 2 weeks ago
- ☆70Updated last month
- ☆57Updated last year
- ☆44Updated last year
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆76Updated 4 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆80Updated 3 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆23Updated 2 months ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆31Updated last month
- FQGAN: Factorized Visual Tokenization and Generation☆39Updated last week
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆71Updated 5 months ago
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆47Updated last month