bytedance / CascadeV
DiT for VAE (and Video Generation)
β18Updated 2 months ago
Related projects β
Alternatives and complementary repositories for CascadeV
- InstantUnify: Integrates Multimodal LLM into Diffusion Models π₯β36Updated 3 months ago
- Official Repo for Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generationβ25Updated 7 months ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformersβ34Updated 3 weeks ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Methodβ26Updated 6 months ago
- β35Updated 7 months ago
- [ICLR 2024] Code for FreeNoise based on AnimateDiffβ105Updated 9 months ago
- β64Updated 5 months ago
- Official code for AnyMaker: Zero-shot General Object Customization via Decoupled Dual-Level ID Injectionβ37Updated 4 months ago
- β103Updated 8 months ago
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Modelsβ108Updated 4 months ago
- [ArXiv 2024] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting witβ¦β92Updated 3 weeks ago
- Code of RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Imagesβ53Updated this week
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.β84Updated 3 weeks ago
- β42Updated 6 months ago
- [WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuningβ66Updated 2 weeks ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)β91Updated 3 months ago
- [CVPR2024] CapHuman: Capture Your Moments in Parallel Universesβ91Updated 4 months ago
- β23Updated 6 months ago
- More suitable IP-Adapter for the DiT architectureβ26Updated 4 months ago
- β19Updated last month
- FasterCache: Training-Free Video Diffusion Model Acceleration with High Qualityβ136Updated this week
- β21Updated 2 months ago
- Unofficial implementation of Layer Diffuse in diffusersβ25Updated 7 months ago
- Official implementation of Image Conductor: Precision Control for Interactive Video Synthesisβ78Updated 3 months ago
- [CVPR2024] Official code for Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagationβ81Updated 6 months ago
- [ECCV 2024] HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidanceβ39Updated last month
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedbackβ34Updated this week
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"β34Updated 2 months ago
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingβ74Updated 6 months ago