LeapLabTHU / AdaNAT
[ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
☆33Updated 5 months ago
Alternatives and similar repositories for AdaNAT:
Users that are interested in AdaNAT are comparing it to the libraries listed below
- [NeurIPS 2024] ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis☆22Updated 3 months ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆42Updated 8 months ago
- [ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators☆43Updated 5 months ago
- Official repository of Uni-AdaFocus (TPAMI 2024).☆41Updated 2 months ago
- Official implementation of Dynamic Perceiver☆42Updated last year
- ☆29Updated 2 months ago
- Liquid: Language Models are Scalable and Unified Multi-modal Generators☆67Updated this week
- [NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding☆66Updated last month
- [ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning☆28Updated 5 months ago
- ☆24Updated 4 months ago
- ☆17Updated last month
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"☆45Updated last month
- [NeurIPS 2022] Latency-aware Spatial-wise Dynamic Networks☆24Updated last year
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆84Updated 4 months ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆90Updated last year
- ☆16Updated 4 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆32Updated 8 months ago
- [IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition☆47Updated 10 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆26Updated 2 weeks ago
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆66Updated 9 months ago
- Official repository of InLine attention (NeurIPS 2024)☆43Updated 2 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆70Updated 4 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆51Updated 2 weeks ago
- VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation☆85Updated 5 months ago