jiawen-zhu / TrackGPT
Tracking with Human-Intent Reasoning
☆70Updated 4 months ago
Alternatives and similar repositories for TrackGPT:
Users that are interested in TrackGPT are comparing it to the libraries listed below
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆87Updated last year
- Large-Vocabulary Video Instance Segmentation dataset☆82Updated 8 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆28Updated last year
- A list of referring video object segmentation papers☆30Updated 3 weeks ago
- Fast and general video object segmentation evaluation.☆29Updated last year
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆74Updated 2 months ago
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆52Updated last year
- ☆40Updated 5 months ago
- Video Reasoning Segmentation☆20Updated 3 months ago
- Awesome video instance segmentation papers☆37Updated 2 weeks ago
- [CVPR 2024] Context-Guided Spatio-Temporal Video Grounding☆51Updated 9 months ago
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆77Updated 9 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆47Updated 8 months ago
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆43Updated 8 months ago
- ICCV'2023 | CTVIS: Consistent Training for Online Video Instance Segmentation☆75Updated last year
- This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.☆10Updated last year
- Video Feature Enhancement with PyTorch☆28Updated 4 months ago
- ☆75Updated last year
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆13Updated 8 months ago
- [ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM☆70Updated 5 months ago
- [NeurIPS 2024] VastTrack: Vast Category Visual Object Tracking☆63Updated 5 months ago
- ☆50Updated 9 months ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated last year
- Improving Mamaba performance on Video Understanding task☆38Updated 5 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆67Updated 5 months ago
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆31Updated last year
- This work is accepted by CVPR2023☆36Updated last year
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆63Updated last year
- The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".☆12Updated last year