Wangbiao2 / R1-TrackLinks
R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning.
☆67Updated 8 months ago
Alternatives and similar repositories for R1-Track
Users that are interested in R1-Track are comparing it to the libraries listed below
Sorting:
- (ICLR 2025) The official pytorch implementation of "UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation"☆34Updated last week
- OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models☆145Updated 9 months ago
- Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos☆303Updated 4 months ago
- [CVPR‘ 2025 ] JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration☆250Updated last month
- The official implementation for the CVPR'2025 paper Dynamic Updates for Language Adaptation in Visual-Language Tracking☆36Updated 10 months ago
- ☆23Updated last year
- R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization☆452Updated last month
- [NeurIPS 2025 Spotlight] StreamForest: Efficient Online Video Understanding with Persistent Event Memory☆125Updated 2 months ago
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning☆82Updated 9 months ago
- Robust Tracking via Mamba-based Context-aware Token Learning (AAAI 2025)☆16Updated 2 months ago
- [AAAI2025] SUTrack: Towards Simple and Unified Single Object Tracking☆126Updated 7 months ago
- [NeurIPS 2024] VastTrack: Vast Category Visual Object Tracking☆73Updated 3 months ago
- [ECCV 2024] Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance☆98Updated last month
- Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking☆11Updated last year
- [CVPR 2024] Unified Multi-Sensor Tracker With One Parameter Set☆65Updated last year
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆17Updated last year
- The official pytorch implementation of our AAAI 2024 paper "Unifying Visual and Vision-Language Tracking via Contrastive Learning"☆45Updated last year
- A vision-language tracking paper list, articles related to visual language tracking have been documented.☆42Updated last year
- [AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking☆115Updated 8 months ago
- The official implementation for the paper [Towards Unified Token Learning for Vision-Language Tracking].☆23Updated 2 years ago
- PyTorch implementation of "Efficient Motion Prompt Learning for Robust Visual Tracking" (ICML2025)☆22Updated last month
- BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence☆240Updated 7 months ago
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆39Updated 3 months ago
- SeqTrackv2: Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking☆87Updated last year
- PiVOT uses a foundational model for online automatic visual prompt refinement to aid tracking.☆15Updated 8 months ago
- 🚀【AAAI 2025】Cross-View Referring Multi-Object Tracking☆71Updated 6 months ago
- [NeurIPS 2024] Repository for the paper "OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking".☆26Updated last year
- CVPR24☆63Updated last year
- ☆22Updated last year
- ☆25Updated last year