longvideoagent / LongVideoAgentLinks
☆84Updated last month
Alternatives and similar repositories for LongVideoAgent
Users that are interested in LongVideoAgent are comparing it to the libraries listed below
Sorting:
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆51Updated last week
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆36Updated 2 months ago
- ☆46Updated 3 weeks ago
- [ICCV'25] FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model☆82Updated 6 months ago
- [ICCV 2025] TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆38Updated last year
- ☆92Updated 5 months ago
- ☆132Updated 7 months ago
- An official implementation of SwapAnyone.☆74Updated 10 months ago
- 🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"☆165Updated 6 months ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Updated 2 years ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆50Updated 11 months ago
- ☆58Updated 3 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆73Updated last year
- VideoAuteur: Towards Long Narrative Video Generation☆43Updated 3 months ago
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆123Updated last month
- ☆133Updated 10 months ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆37Updated last month
- Controllable Animation Video Generation with Large Models-based Multimodal Agents☆227Updated 3 weeks ago
- [AAAI 2026] We present LAMIC, a Layout-Aware Multi-Image Composition framework, that extends single-reference diffusion models to multi-r…☆31Updated 5 months ago
- ☆52Updated last year
- [IJCV 2026] HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts☆26Updated 11 months ago
- [3DV 2025] Learning Naturally Aggregated Appearance for Efficient 3D Editing☆33Updated 11 months ago
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆48Updated 4 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆72Updated 3 months ago
- ObjCtrl-2.5D☆58Updated 10 months ago
- Official PyTorch Implementation for Dual-Process Image Generation, ICCV 2025☆122Updated 5 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated last year
- ☆95Updated 10 months ago
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆114Updated 8 months ago
- Awesome-DragGAN: A curated list of papers, tutorials, repositories related to DragGAN☆83Updated 2 years ago