zrporz / AutoSeg-SAM2
This is an automatic full segmentation tool based on Segment-Anything-2 and Segment-Anything-1. Our tool performs automatic full segmentation of the video, enabling the tracking of each object and the detection of possible new objects.
☆140Updated 4 months ago
Alternatives and similar repositories for AutoSeg-SAM2:
Users that are interested in AutoSeg-SAM2 are comparing it to the libraries listed below
- GenXD: Generating Any 3D and 4D Scenes☆175Updated 3 weeks ago
- ☆230Updated last month
- ☆257Updated 4 months ago
- [WACV 2025] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think☆388Updated 2 months ago
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆585Updated 10 months ago
- [CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis☆333Updated 5 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆262Updated 3 months ago
- Official repository for "Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation"☆65Updated 5 months ago
- "Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Li…☆271Updated last month
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆156Updated 9 months ago
- Code for PhysDreamer☆537Updated last week
- [AAAI 2025🔥] Official implementation of Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle☆195Updated this week
- Official implementation of Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting (ECCV…☆274Updated 6 months ago
- "4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency", Yuyang Yin*, Dejia Xu*, Zhangyang Wang, Yao Zhao, Yunchao Wei☆229Updated 7 months ago
- [CVPR'24] Interactive3D: Create What You Want by Interactive 3D Generation☆178Updated 5 months ago
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆152Updated 3 weeks ago
- Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"☆270Updated 2 weeks ago
- 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation☆321Updated last month
- A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆217Updated 2 months ago
- DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling☆138Updated this week
- ☆136Updated last month
- [ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints☆492Updated 2 months ago
- Official impl. of "MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation"☆118Updated 2 months ago
- Code for "Real3D: Scaling Up Large Reconstruction Models with Real-World Images"☆163Updated 8 months ago
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆222Updated 3 months ago
- Prompt Depth Anything☆549Updated this week
- official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"☆97Updated this week
- Official code of "Imagine360: Immersive 360 Video Generation from Perspective Anchor"☆99Updated 3 weeks ago
- [CVPR 2024] SceneWiz3D: Towards Text-guided 3D Scene Composition☆96Updated 9 months ago