Event-AHU / Open_VLTrackLinks
Vision-Language based Visual Object Tracking
☆17Updated last month
Alternatives and similar repositories for Open_VLTrack
Users that are interested in Open_VLTrack are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆16Updated last year
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆24Updated 2 months ago
- ☆26Updated last year
- ☆24Updated 4 months ago
- This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.☆10Updated last year
- ☆24Updated 6 months ago
- ☆32Updated last year
- ☆12Updated 6 months ago
- CLIMB-ReID: A Hybrid CLIP-Mamba Framework for Person Re-Identification(AAAI2025)☆29Updated 3 months ago
- [ICASSP'25] Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues☆15Updated 9 months ago
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆51Updated 10 months ago
- Video Feature Enhancement with PyTorch☆30Updated 10 months ago
- TrackGPT: Track What You Need in Videos via Text Prompts☆25Updated 2 years ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆67Updated 11 months ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆38Updated 3 months ago
- ☆17Updated 10 months ago
- ☆19Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated 2 years ago
- [ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs☆37Updated 3 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Updated last year
- Open-vocabulary Semantic Segmentation☆33Updated last year
- ☆30Updated last year
- ☆13Updated 9 months ago
- [ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM☆82Updated 11 months ago
- ☆10Updated last year
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- [NeurIPS 2024] Repository for the paper "OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking".☆24Updated 11 months ago
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆23Updated 7 months ago
- [CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval☆28Updated 6 months ago
- [CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding☆67Updated 3 months ago