licksylick / AutoTrackAnythingLinks
AutoTrackAnything is a universal, flexible and interactive tool for insane automatic object tracking over thousands of frames. It is developed upon XMem, Yolov8 and MobileSAM (Segment Anything), can track anything which detect Yolov8.
β92Updated last year
Alternatives and similar repositories for AutoTrackAnything
Users that are interested in AutoTrackAnything are comparing it to the libraries listed below
Sorting:
- Combining "segment-anything" with MOT, it create the era of "MOTS"β156Updated 2 years ago
- π [ICLR 2025] OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformerβ87Updated 6 months ago
- Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"β124Updated 7 months ago
- [AAAI 2026] Code for "SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation".β159Updated 2 months ago
- OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]β112Updated last year
- Official code for NetTrack [CVPR 2024]β111Updated last year
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anythingβ268Updated 9 months ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"β457Updated 3 months ago
- [ECCV24] Keypoint Promptable Re-Identification: SOTA ReID method robust to occlusions and multi-person ambiguityβ199Updated 7 months ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmarkβ177Updated 3 months ago
- YOLO-World + EfficientViT SAMβ106Updated last year
- Includes the VideoCount dataset and CountVid code for the paper Open-World Object Counting in Videos.β86Updated last month
- Official Code for Tracking Any Object Amodallyβ120Updated last year
- DVIS: Decoupled Video Instance Segmentation Frameworkβ158Updated last year
- [ICCV2023] MixSort: The Customized Tracker in SportsMOTβ90Updated 2 years ago
- Focusing on Tracks for Online Multi-Object Trackingβ91Updated 4 months ago
- Official implementation of π« CAMELTrack: Context-Aware Multi-cue ExpLoitation for Online Multi-Object Tracking π«β112Updated 2 weeks ago
- A Graph-Based Approach for Category-Agnostic Pose Estimation [ECCV 2024]β384Updated last year
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.β134Updated last year
- Codebase for the Recognize Anything Model (RAM)β88Updated 2 years ago
- Official Pytorch Implementation for βDINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Videoβ (ECCV 2024)β549Updated last year
- β134Updated last year
- [ICCV 2023] ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Trackingβ165Updated last year
- [ECCV 2024 & NeurIPS 2024 & ICLR 2026] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3β270Updated last week
- Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.β301Updated 7 months ago
- β83Updated last month
- Official code for CAVIS: Context-Aware Video Instance Segmentationβ95Updated 4 months ago
- yolov8 model with SAM metaβ143Updated 2 years ago
- Implementation of Tracking Every Thing in the Wild, ECCV 2022β96Updated last year
- DETRPose: Real-time end-to-end transformer model for multi-person pose estimationβ68Updated last month