983632847 / All-in-OneLinks
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment
☆18Updated 8 months ago
Alternatives and similar repositories for All-in-One
Users that are interested in All-in-One are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆16Updated last year
- Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking☆11Updated last year
- ☆19Updated last year
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆30Updated 3 months ago
- The official pytorch implementation of our AAAI 2024 paper "Unifying Visual and Vision-Language Tracking via Contrastive Learning"☆45Updated 11 months ago
- ☆13Updated last year
- The official implementation for the CVPR 2023 paper Joint Visual Grounding and Tracking with Natural Language Specification.☆73Updated 2 years ago
- [NeurIPS 2024] Repository for the paper "OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking".☆24Updated 11 months ago
- The official implementation for the CVPR'2025 paper Dynamic Updates for Language Adaptation in Visual-Language Tracking☆31Updated 6 months ago
- ☆21Updated 9 months ago
- Robust Tracking via Mamba-based Context-aware Token Learning (AAAI 2025)☆16Updated 6 months ago
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆94Updated 4 months ago
- A curated list of RGB-Event (RGB-E) Tracking papers, datasets, and projects.☆16Updated last year
- Tracking with Human-Intent Reasoning☆72Updated 11 months ago
- Progressive Language-guided Visual Learning for Multi-Task Visual Grounding☆11Updated 5 months ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆56Updated last month
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆17Updated last year
- The official implementation for the paper [Towards Unified Token Learning for Vision-Language Tracking].☆20Updated last year
- Multi-Granularity Language-Guided Multi-Object Tracking☆23Updated last month
- [CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection☆26Updated last year
- [ICASSP'25] Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues☆15Updated 9 months ago
- Awesome video instance segmentation papers☆45Updated last month
- This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.☆10Updated last year
- ☆73Updated last year
- [AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆24Updated 9 months ago
- Official pytorch repository for "TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight Detection" (AAAI 2024 Pape…☆52Updated 7 months ago
- PiVOT uses a foundational model for online automatic visual prompt refinement to aid tracking.☆13Updated 4 months ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆38Updated 3 months ago
- ☆41Updated last year
- PyTorch implementation of "Efficient Motion Prompt Learning for Robust Visual Tracking" (ICML2025)☆21Updated 4 months ago