Luo-Yihong / TDM
[Few-Step Student Surpasses Teacher Diffusion] Learning Few-Step Diffusion Models by Trajectory Distribution Matching
☆35Updated last month
Alternatives and similar repositories for TDM:
Users that are interested in TDM are comparing it to the libraries listed below
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆48Updated 3 weeks ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆81Updated 5 months ago
- Improving Video Generation with Human Feedback☆164Updated 3 weeks ago
- ☆61Updated 4 months ago
- Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Vi…☆171Updated last month
- Official Implementation of VideoDPO☆96Updated this week
- Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆171Updated last week
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆39Updated 2 weeks ago
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆145Updated 5 months ago
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆109Updated 3 weeks ago
- ☆92Updated 3 weeks ago
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer☆76Updated last month
- An Efficient Text-to-Image Generation Pretrain Pipeline☆103Updated last week
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆284Updated 2 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆140Updated 2 months ago
- Subjects200K dataset☆107Updated 3 months ago
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆45Updated this week
- [Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing☆84Updated last month
- Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆134Updated this week
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆115Updated 4 months ago
- ☆84Updated 5 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆88Updated 2 months ago
- ☆40Updated 3 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆55Updated last month
- A collection of diffusion models based on FLUX/DiT for image/video generation, editing, reconstruction, inpainting .etc.☆40Updated this week
- [CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)☆74Updated last month
- EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆97Updated 2 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆36Updated last month
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆204Updated 2 weeks ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆115Updated 3 months ago