MCG-NJU / FlowDCN
[NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
☆31Updated 4 months ago
Alternatives and similar repositories for FlowDCN:
Users that are interested in FlowDCN are comparing it to the libraries listed below
- ☆27Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated 10 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆81Updated 5 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆34Updated 10 months ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆31Updated 2 months ago
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆97Updated last month
- ☆52Updated 2 years ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆31Updated 5 months ago
- ☆61Updated last year
- Teach-DETR: Better Training DETR with Teachers☆31Updated last year
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆39Updated last month
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- ☆45Updated last year
- Denoising Diffusion Step-aware Models (ICLR2024)☆60Updated last year
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 5 months ago
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"☆46Updated last month
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago
- ☆72Updated last month
- Adapting LLaMA Decoder to Vision Transformer☆28Updated 11 months ago
- [ICCV 2023] On the Effectiveness of Spectral Discriminators for Perceptual Quality Improvement☆65Updated last year
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆68Updated 6 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆29Updated 5 months ago
- [CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention☆17Updated last month
- ☆15Updated 2 months ago
- Official implementation of LaVin-DiT☆32Updated 3 months ago
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆57Updated 2 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆31Updated 2 months ago
- The official implementation of ADDP (ICLR 2024)☆12Updated last year
- Official PyTorch Code for "Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?" (https://arxiv.org/abs/2305.12954)☆46Updated last year