FudanCVL / AnyI2VLinks
[ICCV 2025] AnyI2V: Animating Any Conditional Image with Motion Control Generation
☆116Updated last month
Alternatives and similar repositories for AnyI2V
Users that are interested in AnyI2V are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆42Updated 3 weeks ago
- Code of BRIDGE: Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation☆53Updated this week
- [ICCV 2025] Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions☆46Updated last month
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆86Updated last year
- Multimodal Referring Segmentation☆141Updated 3 weeks ago
- A Survey of Image Editing☆435Updated last month
- [CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation☆18Updated 2 years ago
- [CVPR-2023] Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation☆189Updated 2 years ago
- [ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes☆357Updated last week
- [AAAI 2025] MultiBooth: This repo is the official implementation of "MultiBooth: Towards Generating All Your Concepts in an Image from Te…☆117Updated 9 months ago
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation☆65Updated last year
- Hash3D: Training-free Acceleration for 3D Generation☆173Updated 11 months ago
- A benchmark dataset for GRES and GREC [CVPR2023 Highlight]☆237Updated 2 years ago
- [ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions☆519Updated last month
- [TIP-2023] Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation☆80Updated 2 years ago
- [NeurIPS 2025 spotlight] QFFT, Question-Free Fine-Tuning for Adaptive Reasoning☆91Updated 2 weeks ago
- ☆11Updated last week
- [ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation☆358Updated 3 years ago
- This is the official code for the paper Tailor3D☆178Updated last year
- this is a tool and a displayer that allows us to place the 3D model and reshape them.☆14Updated 2 years ago
- [EMNLP 2025] RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions☆131Updated 5 months ago
- Official PyTorch implementation of "ViBiDSampler: Enhancing Video Interpolation Using Bidirectional Diffusion Sampler"☆20Updated 8 months ago
- Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments☆49Updated this week
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆69Updated 3 months ago
- VideoDirector [CVPR 2025]☆28Updated 6 months ago
- [CVPR-2018] Context Contrasted Feature and Gated Multi-Scale Aggregation for Scene Segmentation☆25Updated 5 years ago
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models☆79Updated 3 weeks ago
- 基于SpringBoot的项目管理系统-后端☆127Updated 2 months ago
- [CVPR-2019] Semantic Correlation Promoted Shape-Variant Context for Segmentation☆32Updated 6 years ago
- [CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)☆607Updated 4 months ago