thu-ml / CCA
Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"
☆20Updated last week
Related projects ⓘ
Alternatives and complementary repositories for CCA
- ☆31Updated 3 weeks ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆78Updated 7 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆31Updated last week
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆23Updated 3 weeks ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆42Updated 5 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆76Updated last month
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆30Updated 4 months ago
- Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"☆96Updated 8 months ago
- Official PyTorch Implementation of "Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models"☆31Updated last month
- 🔥 Aurora Series: A more efficient multimodal large language model series for video.☆47Updated last week
- [NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation☆51Updated 3 weeks ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆41Updated 3 weeks ago
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆59Updated last week
- This is a repo to track the latest autoregressive visual generation papers.☆50Updated this week
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆36Updated last month
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆84Updated 4 months ago
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆40Updated last month
- Vico: Compositional Video Generation as Flow Equalization☆53Updated last week
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆75Updated 7 months ago
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆19Updated last month
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆75Updated 4 months ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆37Updated last year
- fixed official code for paper "A Closer Look at Parameter-Efficient Tuning in Diffusion Models".☆40Updated last year
- Memory Efficient Training Framework for Large Video Generation Model☆24Updated 7 months ago
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆111Updated last year
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆49Updated last week
- ☆23Updated 6 months ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆49Updated 2 months ago
- Denoising Diffusion Step-aware Models (ICLR2024)☆52Updated 9 months ago
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆162Updated last month