maxin-cn / LatteLinks
The official implementation of Latte: Latent Diffusion Transformer for Video Generation.
☆33Updated 4 months ago
Alternatives and similar repositories for Latte
Users that are interested in Latte are comparing it to the libraries listed below
Sorting:
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆142Updated 4 months ago
- ☆90Updated 6 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆85Updated 11 months ago
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆150Updated 7 months ago
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Updated last year
- ☆104Updated 11 months ago
- [IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model☆104Updated 3 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆302Updated 6 months ago
- ☆64Updated 10 months ago
- Official PyTorch implementation for ICLR2024 paper "The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing"☆111Updated last year
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆207Updated 2 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆99Updated 11 months ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆117Updated 5 months ago
- ☆35Updated 4 months ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆158Updated last year
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆102Updated last year
- ☆105Updated last week
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆105Updated 11 months ago
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆103Updated last year
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆78Updated last year
- An Efficient Text-to-Image Generation Pretrain Pipeline☆109Updated 2 months ago
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)☆131Updated 2 months ago
- [ICLR 2025] You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs☆61Updated 3 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆113Updated 3 months ago
- TerDiT: Ternary Diffusion Models with Transformers☆71Updated last year
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆158Updated 7 months ago
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆48Updated 9 months ago
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆155Updated 9 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆91Updated 4 months ago
- The HD-VG-130M Dataset☆118Updated last year