[AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
β117May 18, 2025Updated 10 months ago
Alternatives and similar repositories for STTrack
Users that are interested in STTrack are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption πβ46Jul 5, 2025Updated 8 months ago
- β24Apr 3, 2024Updated last year
- CLIP-Guided Object Restoration for Defense Against 3D Point Cloud Backdoor Attacksβ19Nov 28, 2024Updated last year
- [AAAI 2025] Pre-Training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimationβ34Jun 3, 2025Updated 9 months ago
- CoDi:Subject-Consistent and Pose-Diverse Text-to-Image Generationβ37Aug 1, 2025Updated 7 months ago
- [CVPR 2025] Official code of "From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Perspβ¦β48Apr 2, 2025Updated 11 months ago
- CVPR24β65Aug 4, 2024Updated last year
- Bi-directional Adapter for Multi-modal Trackingβ97Mar 19, 2024Updated 2 years ago
- Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation ACMMM2024β22Oct 16, 2024Updated last year
- SGNet: Structure Guided Network via Gradient-Frequency Awareness for Depth Map Super-Resolution (AAAI 2024)β120Apr 5, 2024Updated last year
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenesβ94Nov 26, 2025Updated 3 months ago
- [ICLR 2026] MotionSight's official code implementation.β47Feb 13, 2026Updated last month
- [AAAI2025] SUTrack: Towards Simple and Unified Single Object Trackingβ132Jun 16, 2025Updated 9 months ago
- Latest Advances on Autoregressive Visual Models.πβ28Mar 15, 2025Updated last year
- [AAAI 2025] Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Videoβ115Jun 14, 2025Updated 9 months ago
- The official repository of the paper "DCDepth: Progressive Monocular Depth Estimationin in Discrete Cosine Domain" (NeurIPS-2024)β39Mar 10, 2025Updated last year
- β25Dec 20, 2024Updated last year
- The official implementation for the paper [ODTrack: Online Dense Temporal Token Learning for Visual Tracking].β178Oct 7, 2024Updated last year
- Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction (ICRA 2025)β54Dec 7, 2025Updated 3 months ago
- β47Mar 10, 2026Updated last week
- Similarity-Guided Layer-Adaptive Vision Transformer for UAV Tracking (CVPR 2025)β83Jul 15, 2025Updated 8 months ago
- [ECCV 2024] Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performanceβ102Feb 6, 2026Updated last month
- [CVPR 2024] Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistanceβ15Jun 3, 2025Updated 9 months ago
- Official implementation of "Group Orthogonal Low-Rank Adaptation for RGB-T Tracking" (AAAI2026)β15Dec 24, 2025Updated 2 months ago
- β66Nov 10, 2023Updated 2 years ago
- PyTorch implementation of "Efficient Motion Prompt Learning for Robust Visual Tracking" (ICML2025)β24Dec 17, 2025Updated 3 months ago
- #ICCV, #MoE, #Trackingβ33Jul 11, 2025Updated 8 months ago
- RGBT Tracking via All-layer Multimodal Interactions with Mambaβ16May 7, 2025Updated 10 months ago
- The official pytorch implementation of our AAAI 2024 paper "Unifying Visual and Vision-Language Tracking via Contrastive Learning"β46Nov 4, 2024Updated last year
- This is the official repository of UltraHR-100K.β46Nov 21, 2025Updated 4 months ago
- PaperCode: Unified Single-Stage Transformer Network for Efficient RGB-T TrackingοΌAccepted by IJCAI2024οΌβ11Sep 24, 2024Updated last year
- β121Jan 8, 2025Updated last year
- A survery for multi-modal visual object tracking, including RGBT, RGBD, RGBE, RGBL, RGBNIR, RGBS.β75Dec 4, 2025Updated 3 months ago
- Transactions on Multimedia (TMM25)β19Apr 8, 2025Updated 11 months ago
- Modality-missing RGBT Tracking: Invertible Prompt Learning and High-quality Benchmarks (IJCV2024))β23Mar 13, 2026Updated last week
- β13Jul 15, 2024Updated last year
- Scene Prior Filtering for Depth Map Super-Resolutionβ28Dec 1, 2025Updated 3 months ago
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β620Dec 12, 2025Updated 3 months ago
- [CVPR 2025] Learning Occlusion-Robust Vision Transformers for Real-Time UAV Trackingβ91Jun 12, 2025Updated 9 months ago