Uncertainty-aware Fine-tuning of Segmentation Foundation Models (NeurIPS 2024).
☆14Jan 9, 2025Updated last year
Alternatives and similar repositories for SUM
Users that are interested in SUM are comparing it to the libraries listed below
Sorting:
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆18Aug 10, 2022Updated 3 years ago
- Learning Better Video Query with SAM for Video Instance Segmentation (TCSVT 2024)☆26Apr 2, 2024Updated last year
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆27Feb 28, 2026Updated last week
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆19Jul 10, 2025Updated 7 months ago
- Weakly Supervised Referring Video Object Segmentation with Object-Centric Pseudo-Guidance☆10Aug 17, 2024Updated last year
- The official repository of "MarineInst: A Foundation Model for Marine Image Analysis with Instance Visual Description". [ECCV Oral 2024.]☆17Sep 24, 2024Updated last year
- ☆13Aug 27, 2020Updated 5 years ago
- ☆12Aug 19, 2023Updated 2 years ago
- Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector☆11Jun 24, 2023Updated 2 years ago
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- [CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".☆20Jun 16, 2025Updated 8 months ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆21Jun 23, 2025Updated 8 months ago
- Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'☆12Mar 22, 2023Updated 2 years ago
- Recording of Kinect V2 Streams at 30 fps.☆10Jul 5, 2017Updated 8 years ago
- DAVIS web repo☆10Jan 26, 2023Updated 3 years ago
- This is a repo for paper of "Sam-Based Instance Segmentation Models for the Automation of Structural Damage Detection"☆13Feb 22, 2025Updated last year
- ☆15Dec 2, 2025Updated 3 months ago
- ☆18Jul 3, 2025Updated 8 months ago
- INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness☆14Nov 10, 2025Updated 3 months ago
- An implementation of Real-Time Salient Object Detection with a Minimum Spanning Tree, CVPR 2016☆12Jul 15, 2019Updated 6 years ago
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of "Classification Matters: Improving Video Action Detection with Class-Specific Attention"☆17Nov 8, 2024Updated last year
- LLaVA-Next for STVG☆18Dec 5, 2025Updated 3 months ago
- Extensive study and research on Udacity Self-driving Car Challenge 2☆10Dec 11, 2021Updated 4 years ago
- Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation[TNNLS2024]☆13May 6, 2025Updated 10 months ago
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Dec 30, 2024Updated last year
- Experiments with self-supervised learning☆11Mar 9, 2020Updated 5 years ago
- Implementation of 'Attention-guided Feature Fusion for Small Object Detection'☆14Dec 21, 2023Updated 2 years ago
- Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".☆39Jun 9, 2025Updated 8 months ago
- Data & Code for FEDD published @ MICCAI 23☆12Oct 11, 2023Updated 2 years ago
- ☆12May 26, 2023Updated 2 years ago
- "Visual Prompt Selection for In-Context Learning Segmentation Framework"☆15Dec 13, 2024Updated last year
- ☆12Jan 10, 2025Updated last year
- ☆13Nov 20, 2021Updated 4 years ago
- [ICLR'23] Effective Self-supervised Pre-training on Low-compute networks without Distillation☆18Oct 9, 2024Updated last year
- SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability☆16May 8, 2025Updated 9 months ago
- Official implementation of Transformer-based Efficient Salient Instance Segmentation Networks with Orientative Query☆12Sep 15, 2022Updated 3 years ago
- ☆13May 23, 2022Updated 3 years ago
- This is a PyTorch implementation of "Cross-modality Discrepant Interaction Network for RGB-D Salient Object Detection" accepted by ACM MM…☆11Nov 22, 2021Updated 4 years ago