zrporz / AutoSeg-SAM2
This is an automatic full segmentation tool based on Segment-Anything-2 and Segment-Anything-1. Our tool performs automatic full segmentation of the video, enabling the tracking of each object and the detection of possible new objects.
☆159Updated 5 months ago
Alternatives and similar repositories for AutoSeg-SAM2:
Users that are interested in AutoSeg-SAM2 are comparing it to the libraries listed below
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆206Updated last month
- ☆264Updated 6 months ago
- GenXD: Generating Any 3D and 4D Scenes. ICLR 2025☆176Updated this week
- ☆257Updated 3 months ago
- The official implementation of "GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation". (CVPR 2025)☆205Updated this week
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆463Updated 3 months ago
- [ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".☆304Updated this week
- [ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video …☆110Updated last week
- Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"☆287Updated last month
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆269Updated last month
- [CVPR'24] Interactive3D: Create What You Want by Interactive 3D Generation☆180Updated 6 months ago
- Code for PhysDreamer☆545Updated last month
- A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆233Updated 4 months ago
- ☆511Updated 10 months ago
- Aether: Geometric-Aware Unified World Modeling☆120Updated last week
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆162Updated 10 months ago
- [CVPR 2025] Prompt Depth Anything☆666Updated 3 weeks ago
- Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).☆336Updated 2 weeks ago
- High-quality and editable surfel 3D Gaussian generation through native 3D diffusion (ICLR 2025)☆289Updated last month
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆237Updated 5 months ago
- Official repository for "Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation" (ICLR2025)☆68Updated last week
- Official code of "Imagine360: Immersive 360 Video Generation from Perspective Anchor"☆113Updated last week
- "Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Li…☆283Updated 2 months ago
- We have released official implementation in https://github.com/VAST-AI-Research/MIDI-3D☆129Updated 2 weeks ago
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think☆407Updated 3 months ago
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆598Updated 11 months ago
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆261Updated 3 months ago
- ViewDiff generates high-quality, multi-view consistent images of a real-world 3D object in authentic surroundings. (CVPR2024).☆363Updated this week
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆173Updated 2 months ago
- [CVPR 2025] Official code for Using Diffusion Priors for Video Amodal Segmentation☆64Updated this week