maxin-cn / LatteLinks
The official implementation of Latte: Latent Diffusion Transformer for Video Generation.
☆33Updated 4 months ago
Alternatives and similar repositories for Latte
Users that are interested in Latte are comparing it to the libraries listed below
Sorting:
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆152Updated 8 months ago
- An Efficient Text-to-Image Generation Pretrain Pipeline☆110Updated 2 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆143Updated 4 months ago
- ☆97Updated 7 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆302Updated 6 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆207Updated 3 months ago
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆156Updated 9 months ago
- ☆173Updated last year
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation☆192Updated 4 months ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆160Updated last year
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆278Updated 3 months ago
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆117Updated 8 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆103Updated last year
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆116Updated 4 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆302Updated 5 months ago
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation☆222Updated 8 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆85Updated last year
- ☆105Updated last year
- Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing☆52Updated last year
- [IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model☆105Updated 3 months ago
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)☆131Updated 3 months ago
- ☆176Updated last year
- Official implementation of FouriScale (ECCV2024)☆154Updated 11 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆277Updated 7 months ago
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…☆120Updated last year
- [NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"☆339Updated 4 months ago
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆159Updated 7 months ago
- The HD-VG-130M Dataset☆118Updated last year
- ☆111Updated last month
- ☆200Updated 5 months ago