LeapLabTHU / ImprovedNAT
A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"
☆34Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for ImprovedNAT
- [ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation☆31Updated 2 months ago
- Official implementation of Dynamic Perceiver☆41Updated last year
- This is a repo to track the latest autoregressive visual generation papers.☆50Updated this week
- ☆36Updated last year
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆91Updated last year
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆42Updated 5 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆30Updated 5 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆78Updated 10 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆28Updated 5 months ago
- [IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition☆42Updated 6 months ago
- [ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators☆33Updated 2 months ago
- MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation☆25Updated last year
- Codebase for the paper-Elucidating the design space of language models for image generation☆31Updated last week
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆59Updated last week
- ☆31Updated 3 weeks ago
- Jittor implementation of Vision Transformer with Deformable Attention☆30Updated 2 years ago
- [CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning☆82Updated last year
- ☆38Updated last month
- Denoising Diffusion Step-aware Models (ICLR2024)☆52Updated 9 months ago
- [NeurIPS 2022] Latency-aware Spatial-wise Dynamic Networks☆24Updated last year
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆64Updated 5 months ago
- ☆52Updated last year
- ☆58Updated last year
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆29Updated last week
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆19Updated 3 weeks ago
- [CVPR 2024 Highlight] ImageNet-D☆38Updated last month
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆51Updated 3 months ago
- VisualGPTScore for visio-linguistic reasoning☆26Updated last year
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆41Updated 3 weeks ago