bytedance / CascadeV
DiT for VAE (and Video Generation)
☆33Updated 7 months ago
Alternatives and similar repositories for CascadeV:
Users that are interested in CascadeV are comparing it to the libraries listed below
- Blending Custom Photos with Video Diffusion Transformers☆47Updated 2 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆30Updated 3 weeks ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆99Updated last month
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆53Updated last month
- This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation☆26Updated last week
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (arXiv, 2024)☆50Updated 4 months ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆49Updated last month
- Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆26Updated last week
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆130Updated 6 months ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆47Updated 7 months ago
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Updated last year
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆52Updated last month
- An official implementation of SwapAnyone.☆59Updated last month
- ☆83Updated 7 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Updated last year
- ☆58Updated 4 months ago
- ☆47Updated 3 months ago
- ☆23Updated 3 weeks ago
- ☆47Updated 4 months ago
- [WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning☆82Updated 4 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆47Updated 6 months ago
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆54Updated last week
- [ICLR 2024] Code for FreeNoise based on LaVie☆35Updated last year
- [AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with…☆120Updated 6 months ago
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆42Updated 3 weeks ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆26Updated 11 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆68Updated 3 months ago
- [AAAI 2025] CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities☆44Updated 3 months ago
- [ARXIV'24] StyleMaster: Stylize Your Video with Artistic Generation and Translation☆100Updated 2 weeks ago
- ☆67Updated 10 months ago