[CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"
☆367Sep 25, 2025Updated 5 months ago
Alternatives and similar repositories for SAMWISE
Users that are interested in SAMWISE are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025 Spotlight] "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."☆185Dec 17, 2025Updated 2 months ago
- Official implementation of https://arxiv.org/abs/2106.03496☆15Jul 27, 2022Updated 3 years ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆459Oct 23, 2025Updated 4 months ago
- Official Repository for "Communication Efficient Federated Learning with Generalized Heavy-Ball Momentum", accepted at TMLR 2025☆27Jul 14, 2025Updated 7 months ago
- [CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).☆535Oct 27, 2025Updated 4 months ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆43Dec 7, 2024Updated last year
- Interface to stable-baselines3 APIs for training RL policies on gym-registered environments☆12Jan 24, 2024Updated 2 years ago
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆24Jun 13, 2024Updated last year
- DROPO: Sim-to-Real Transfer with Offline Domain Randomization☆25Jul 8, 2025Updated 7 months ago
- List of papers wrote by Focoos AI research team!☆12Jun 3, 2025Updated 9 months ago
- ☆19May 20, 2022Updated 3 years ago
- Official Repo For Pixel-LLM Codebase☆1,543Jan 23, 2026Updated last month
- ☆13Jul 22, 2025Updated 7 months ago
- Code for the paper "A Sea of Words: An In-Depth Analysis of Anchors for Text Data", AISTATS 2023☆15Oct 26, 2024Updated last year
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆129Nov 14, 2025Updated 3 months ago
- Official PyTorch implementation of "Speeding up Heterogeneous Federated Learning with Sequentially Trained Superclients", accepted at ICP…☆17Mar 29, 2022Updated 3 years ago
- Official code for CAVIS: Context-Aware Video Instance Segmentation☆97Sep 17, 2025Updated 5 months ago
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆47Sep 28, 2024Updated last year
- [CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆130Mar 10, 2025Updated 11 months ago
- ☆31Oct 27, 2022Updated 3 years ago
- [CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆180Dec 13, 2024Updated last year
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆85Jul 24, 2024Updated last year
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆873Jan 27, 2026Updated last month
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆497Mar 17, 2025Updated 11 months ago
- [ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆80Oct 22, 2025Updated 4 months ago
- Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].☆35Nov 2, 2024Updated last year
- High Quality Video Reasoning Segmentation☆146Nov 24, 2025Updated 3 months ago
- New Modeling The Background CodeBase☆15Jan 7, 2022Updated 4 years ago
- Collection of gym environments with support for domain randomization☆10Dec 11, 2024Updated last year
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,364May 1, 2025Updated 10 months ago
- A list of referring video object segmentation papers☆57Jun 6, 2025Updated 8 months ago
- Efficient Track Anything☆780Jan 6, 2025Updated last year
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆20Sep 5, 2025Updated 5 months ago
- MaskPlanner is a deep learning model for the quick generation of multiple, long-horizon paths from free-form 3D objects represented as po…☆21Jun 20, 2025Updated 8 months ago
- [ECCV 2024] Official Implementation of "Appearance-Based Refinement for Object-Centric Motion Segmentation" Junyu Xie, Weidi Xie, Andrew …☆13Oct 23, 2024Updated last year
- The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"☆682Apr 21, 2025Updated 10 months ago
- Official repository for "AM-RADIO: Reduce All Domains Into One"☆1,665Feb 11, 2026Updated 2 weeks ago
- Pruned CoTracker architecture for tracking the myocardium in 2D echo images.☆19May 6, 2025Updated 9 months ago
- Visual Relationship Reasoning for Grasp Planning☆18May 22, 2025Updated 9 months ago