thunlp / ACDiTLinks
ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer
☆33Updated 5 months ago
Alternatives and similar repositories for ACDiT
Users that are interested in ACDiT are comparing it to the libraries listed below
Sorting:
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆62Updated 3 months ago
- TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/TokenBridge☆116Updated 3 weeks ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆73Updated last month
- ☆31Updated last week
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆69Updated 7 months ago
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows☆64Updated this week
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆62Updated 2 weeks ago
- ☆163Updated 5 months ago
- [Preprint] UCGM: Unified Continuous Generative Models☆133Updated last week
- ☆55Updated 2 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆98Updated last month
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆111Updated 3 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆68Updated 3 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆81Updated 6 months ago
- Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆55Updated this week
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆69Updated 3 months ago
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆51Updated 6 months ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆96Updated 2 weeks ago
- Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆155Updated last month
- FQGAN: Factorized Visual Tokenization and Generation☆50Updated 2 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆110Updated 3 months ago
- Official Implementation for Diffusion Models Without Classifier-free Guidance☆125Updated 3 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆69Updated 8 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆158Updated 2 months ago
- official code for Diff-Instruct algorithm for one-step diffusion distillation☆76Updated 4 months ago
- Boosting Generative Image Modeling via Joint Image-Feature Synthesis☆34Updated last month
- ☆124Updated 11 months ago
- ☆47Updated 2 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆79Updated last year
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆50Updated 2 months ago