patrick-tssn / Streaming-Grounded-SAM-2
Grounded Tracking for Streaming Videos
☆52Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Streaming-Grounded-SAM-2
- Run Segment Anything Model 2 on a live video stream☆187Updated last month
- ☆145Updated 5 months ago
- ☆211Updated 4 months ago
- [ICCV 2023 R6D] PyTorch implementation of CNOS: A Strong Baseline for CAD-based Novel Object Segmentation based on Segmenting Anything an…☆213Updated last month
- [ECCV 2024] Official implementation of the paper "TAPTR: Tracking Any Point with Transformers as Detection"☆202Updated 3 months ago
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆179Updated 2 weeks ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆69Updated last month
- [ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation☆104Updated 3 months ago
- ☆28Updated 10 months ago
- [CVPR 2024] PyTorch implementation of NOPE: Novel Object Pose Estimation from a Single Image☆185Updated 7 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆48Updated 3 weeks ago
- Welcome to the project repository for POPE (Promptable Pose Estimation), a state-of-the-art technique for 6-DoF pose estimation of any ob…☆141Updated 4 months ago
- HANDAL Dataset and Pipeline☆68Updated 5 months ago
- GraspSplats: Efficient Manipulation with 3D Feature Splatting☆70Updated this week
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)☆125Updated last week
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated 4 months ago
- Grounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆38Updated last year
- Muggled SAM: Segmentation without the magic☆60Updated this week
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆43Updated 2 weeks ago
- Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)☆136Updated 3 weeks ago
- ☆52Updated 2 months ago
- SceneFun3D ToolKit☆79Updated last month
- This is the official repository for OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data. (CoRL'23)☆95Updated last year
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆232Updated last week
- NIDS-Net: A unified framework for novel instance detection and segmentation☆45Updated 2 months ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆89Updated last month
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆197Updated 3 weeks ago
- Official Code For Track Everything Everywhere Fast and Robustly☆48Updated 2 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆95Updated 3 months ago
- [ECCV 2024] ShapeLLM: Universal 3D Object Understanding for Embodied Interaction☆144Updated last month