LeapLabTHU / ImprovedNAT
A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"
☆40Updated 7 months ago
Alternatives and similar repositories for ImprovedNAT:
Users that are interested in ImprovedNAT are comparing it to the libraries listed below
- [ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation☆32Updated 4 months ago
- [NeurIPS 2024] ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis☆22Updated last month
- Open implementation of "RandAR"☆48Updated this week
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆23Updated 2 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 2 months ago
- Official implementation of Dynamic Perceiver☆42Updated last year
- Official repository of InLine attention (NeurIPS 2024)☆35Updated 3 weeks ago
- Liquid: Language Models are Scalable Multi-modal Generators☆60Updated last month
- ☆43Updated 2 weeks ago
- [ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators☆42Updated 4 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆31Updated 7 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆80Updated 3 months ago
- PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆22Updated last month
- Official repository of Uni-AdaFocus (TPAMI 2024).☆31Updated last month
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated last month
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆91Updated last year
- [IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition☆45Updated 8 months ago
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆72Updated 3 weeks ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆57Updated 2 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆81Updated last year
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆32Updated 7 months ago
- ☆36Updated 2 years ago
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"☆43Updated this week
- Jittor implementation of Vision Transformer with Deformable Attention☆30Updated 2 years ago
- ☆20Updated 6 months ago
- Official repository of paper "Subobject-level Image Tokenization"☆64Updated 8 months ago
- The collection of awesome papers on alignment of diffusion models.☆72Updated last month
- The official implementation of "[MASK] is All You Need"☆104Updated last month