[AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
β120May 18, 2025Updated last year
Alternatives and similar repositories for STTrack
Users that are interested in STTrack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Explicit Context Reasoning with Supervision for Visual Tracking (ACM MM 25)β18Jul 20, 2025Updated 10 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption πβ46Jul 5, 2025Updated 11 months ago
- β27Apr 3, 2024Updated 2 years ago
- CLIP-Guided Object Restoration for Defense Against 3D Point Cloud Backdoor Attacksβ20May 11, 2026Updated last month
- [AAAI 2025] Pre-Training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimationβ33Jun 3, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- CoDi:Subject-Consistent and Pose-Diverse Text-to-Image Generationβ37Aug 1, 2025Updated 10 months ago
- The official implementation for the CVPR'2025 paper Dynamic Updates for Language Adaptation in Visual-Language Trackingβ40Mar 27, 2025Updated last year
- CVPR24β68Aug 4, 2024Updated last year
- [CVPR 2025] Official code of "From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Perspβ¦β58Apr 16, 2026Updated last month
- Bi-directional Adapter for Multi-modal Trackingβ100Mar 19, 2024Updated 2 years ago
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenesβ97Nov 26, 2025Updated 6 months ago
- SGNet: Structure Guided Network via Gradient-Frequency Awareness for Depth Map Super-Resolution (AAAI 2024)β120Apr 5, 2024Updated 2 years ago
- [ICLR 2026] MotionSight's official code implementation.β48Apr 24, 2026Updated last month
- Robust Tracking via Mamba-based Context-aware Token Learning (AAAI 2025)β16Nov 6, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [AAAI2025] SUTrack: Towards Simple and Unified Single Object Trackingβ148Jun 16, 2025Updated 11 months ago
- Latest Advances on Autoregressive Visual Models.πβ28Mar 15, 2025Updated last year
- [AAAI 2025] Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Videoβ117Jun 14, 2025Updated 11 months ago
- The official repository of the paper "DCDepth: Progressive Monocular Depth Estimationin in Discrete Cosine Domain" (NeurIPS-2024)β40Mar 10, 2025Updated last year
- β25Dec 20, 2024Updated last year
- The official implementation for the paper [ODTrack: Online Dense Temporal Token Learning for Visual Tracking].β184Oct 7, 2024Updated last year
- Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction (ICRA 2025)β55Dec 7, 2025Updated 6 months ago
- Source code of the paper: Overlapped Trajectory-Enhanced Visual Trackingβ11Sep 3, 2024Updated last year
- β57Mar 10, 2026Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ECCV 2024] Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performanceβ118Feb 6, 2026Updated 4 months ago
- Similarity-Guided Layer-Adaptive Vision Transformer for UAV Tracking (CVPR 2025)β92Apr 21, 2026Updated last month
- [CVPR 2024] Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistanceβ15Jun 3, 2025Updated last year
- Official implementation of "Group Orthogonal Low-Rank Adaptation for RGB-T Tracking" (AAAI2026)β24Dec 24, 2025Updated 5 months ago
- RGBT Tracking via All-layer Multimodal Interactions with Mambaβ17May 7, 2025Updated last year
- PyTorch implementation of "Efficient Motion Prompt Learning for Robust Visual Tracking" (ICML2025)β31Dec 17, 2025Updated 5 months ago
- The official pytorch implementation of our AAAI 2024 paper "Unifying Visual and Vision-Language Tracking via Contrastive Learning"β50Nov 4, 2024Updated last year
- PaperCode: Unified Single-Stage Transformer Network for Efficient RGB-T TrackingοΌAccepted by IJCAI2024οΌβ11Sep 24, 2024Updated last year
- #ICCV, #MoE, #Trackingβ38Jul 11, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- β106Dec 17, 2024Updated last year
- β121Jan 8, 2025Updated last year
- A survery for multi-modal visual object tracking, including RGBT, RGBD, RGBE, RGBL, RGBNIR, RGBS.β84Apr 5, 2026Updated 2 months ago
- Transactions on Multimedia (TMM25)β21Apr 8, 2025Updated last year
- [CVPR 2024] SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Trackingβ69Jun 30, 2024Updated last year
- β13Jul 15, 2024Updated last year
- [CVPR 2025] Official PyTorch implementation of Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Abilityβ127May 19, 2026Updated 3 weeks ago