TRI-ML / VOST
Code for the VOST dataset
☆23Updated last year
Alternatives and similar repositories for VOST:
Users that are interested in VOST are comparing it to the libraries listed below
- ☆9Updated last year
- Large-Vocabulary Video Instance Segmentation dataset☆77Updated 6 months ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆76Updated 7 months ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆32Updated last year
- [ICLR 2024 Spotlight] Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments☆19Updated 2 months ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆33Updated 3 years ago
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation☆31Updated last month
- [ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding☆20Updated last year
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆27Updated last year
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆46Updated 6 months ago
- CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"☆18Updated 7 months ago
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆51Updated 4 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆27Updated 10 months ago
- [ECCV2022] Global Spectral Filter Memory Network for Video Object Segmentation☆39Updated 2 years ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated last year
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆33Updated last month
- ☆47Updated 2 years ago
- ☆12Updated last year
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆37Updated 3 weeks ago
- Code for ECCV2022 Paper "Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection"☆36Updated last year
- ☆57Updated last year
- ☆34Updated 10 months ago
- The official code for Relational Context Learning for Human-Object Interaction Detection, CVPR2023.☆48Updated last year
- RefVOS☆29Updated 3 years ago
- 👾 E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)☆50Updated last week
- ☆8Updated last year
- Accepted by CVPR 2022☆36Updated 2 years ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆35Updated last year
- Official implementation of the NeurIPS 2023 paper "Self-supervised Object-Centric Learning for Videos"☆26Updated 2 months ago
- Series of work (ECCV2020, CVPR2021, CVPR2021, ECCV2022) about Compositional Learning for Human-Object Interaction Exploration☆81Updated last year