oooolga / Ctrl-V
πPytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"
β26Updated 6 months ago
Alternatives and similar repositories for Ctrl-V:
Users that are interested in Ctrl-V are comparing it to the libraries listed below
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generationβ24Updated 6 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controllerβ40Updated 3 weeks ago
- Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"β38Updated last month
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initializationβ18Updated 3 weeks ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpeningβ58Updated 2 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"β46Updated 5 months ago
- A list of works on video generation towards world modelβ53Updated this week
- β33Updated 6 months ago
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformersβ51Updated last week
- Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcingβ48Updated this week
- ArXiv paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151β63Updated 6 months ago
- β14Updated 2 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)β66Updated 2 months ago
- Official implementation of "Divide & Bind Your Attention for Improved Generative Semantic Nursing" (BMVC 2023 Oral)β35Updated last year
- β39Updated last year
- β26Updated last week
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"β41Updated 8 months ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Modelsβ37Updated 7 months ago
- β30Updated last month
- The official repository of "Spectral Motion Alignment for Video Motion Transfer using Diffusion Models".β27Updated 4 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.β47Updated 6 months ago
- β21Updated last year
- β45Updated last month
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'β17Updated 6 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"β68Updated 4 months ago
- [3DV 2025] Learning Naturally Aggregated Appearance for Efficient 3D Editingβ34Updated 2 months ago
- Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversionβ40Updated 9 months ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmarkβ16Updated last week
- β22Updated 10 months ago
- VideoAuteur: Towards Long Narrative Video Generationβ36Updated 3 months ago