bytedance / CascadeV
DiT for VAE (and Video Generation)
☆19Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for CascadeV
- ☆37Updated last week
- Official Repo for Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation☆26Updated 7 months ago
- Official code for CustAny: Customizing Anything from A Single Example☆38Updated this week
- [ArXiv 2024] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting wit…☆95Updated last month
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆75Updated 7 months ago
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Models☆108Updated 5 months ago
- Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing☆50Updated 7 months ago
- ☆17Updated last month
- InstantUnify: Integrates Multimodal LLM into Diffusion Models 🔥☆38Updated 3 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"☆32Updated last week
- ☆35Updated 7 months ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆34Updated last month
- ☆19Updated 2 months ago
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.☆84Updated last month
- ☆65Updated 5 months ago
- ☆31Updated 7 months ago
- [Neurips 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆137Updated last week
- [WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning☆66Updated 3 weeks ago
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Updated last year
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆111Updated last month
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆84Updated 4 months ago
- Official code base for paper EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guid…☆35Updated last month
- Unofficial implementation of Layer Diffuse in diffusers☆25Updated 7 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆78Updated 7 months ago
- ☆73Updated 10 months ago
- Implementation of InstructEdit☆68Updated last year
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆111Updated 4 months ago
- [ICLR 2024] Code for FreeNoise based on AnimateDiff☆106Updated 10 months ago
- ☆21Updated 3 months ago