oooolga / Ctrl-V
πPytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"
β25Updated 3 months ago
Alternatives and similar repositories for Ctrl-V:
Users that are interested in Ctrl-V are comparing it to the libraries listed below
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformersβ35Updated 2 months ago
- β38Updated last year
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decodingβ28Updated 3 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"β44Updated 2 months ago
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generationβ21Updated 4 months ago
- ArXiv paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151β61Updated 4 months ago
- β21Updated last year
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Modelsβ36Updated 4 months ago
- β34Updated 4 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controllerβ33Updated this week
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)β77Updated 11 months ago
- β43Updated 5 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physicsβ77Updated last week
- β47Updated 2 months ago
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"β40Updated 5 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generationβ99Updated 6 months ago
- Official implementation of Auroraβ82Updated last year
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.β46Updated 4 months ago
- Official implementation of "Divide & Bind Your Attention for Improved Generative Semantic Nursing" (BMVC 2023 Oral)β35Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Methodβ26Updated 9 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).β49Updated this week
- β20Updated 7 months ago
- β25Updated 7 months ago
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntaxβ18Updated last year
- [ICLR 2024] Code for FreeNoise based on LaVieβ34Updated last year
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"β66Updated last month
- Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversionβ35Updated 6 months ago
- [NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillationβ58Updated 3 months ago