langmanbusi / InsViELinks
Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”
☆16Updated 3 weeks ago
Alternatives and similar repositories for InsViE
Users that are interested in InsViE are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆55Updated last month
- ☆33Updated 7 months ago
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆18Updated 5 months ago
- [CVPR2025] Official PyTorch implementation of "Optical-Flow Guided Prompt Optimization for Coherent Video Generation (Motion Prompt)"☆22Updated 2 months ago
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆50Updated 2 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆42Updated last month
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆24Updated 5 months ago
- OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation☆13Updated 5 months ago
- ☆43Updated last month
- ☆39Updated last year
- A list of works on video generation towards world model☆101Updated last week
- Implementation of paper EditCLIP: Representation Learning for Image Editing☆23Updated 2 months ago
- Sora Generates Videos with Stunning Geometrical Consistency☆49Updated last year
- open-sourced video dataset with dynamic scenes and camera movements annotation☆59Updated last month
- ☆24Updated last month
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆79Updated 3 weeks ago
- VideoDirector [CVPR 2025]☆19Updated 2 months ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆21Updated last month
- ☆33Updated 2 months ago
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆33Updated 2 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆62Updated 3 months ago
- This is the official implementation of VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Mode…☆13Updated 3 months ago
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆117Updated 2 weeks ago
- Training-Free Text-Guided Image Editing Using Visual Autoregressive Model☆44Updated last month
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆49Updated 4 months ago
- Code of the paper "FreePCA:Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…☆20Updated last week
- ☆30Updated last year
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆25Updated 7 months ago
- [ECCV2024] ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation☆53Updated 2 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆44Updated 3 months ago