ZhangDailing8 / CPDTrackLinks
☆21Updated 10 months ago
Alternatives and similar repositories for CPDTrack
Users that are interested in CPDTrack are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆16Updated 10 months ago
- Official Implementation of ECCV2024 paper: SLAck☆28Updated 11 months ago
- code for affordance-r1☆24Updated this week
- This is the official implementation of "GvSeg: General and Task-Oriented Video Segmentation" (Accepted at ECCV 2024).☆18Updated last year
- [CVPR2024] DiffusionTrack: Point set Diffussion Model for Visual Object Tracking☆31Updated last week
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆37Updated last year
- DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆158Updated this week
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆49Updated 2 months ago
- ☆81Updated 3 months ago
- Code & Weights for “Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation”☆13Updated 8 months ago
- TrackGPT: Track What You Need in Videos via Text Prompts☆25Updated 2 years ago
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆17Updated 4 months ago
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆50Updated 9 months ago
- [NeurIPS 2023 Spotlight] ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking☆18Updated last year
- Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces☆79Updated 2 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆22Updated 3 weeks ago
- [ICRA2024] Darkness Clue-Prompted Tracking in Nighttime UAVs☆29Updated last year
- [NeurIPS-2024] The offical Implementation of "Instruction-Guided Visual Masking"☆38Updated 9 months ago
- Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"☆43Updated this week
- ☆24Updated 8 months ago
- [IEEE TCSVT 2025] Event stream based visual object tracking using Mamba/State Space Model☆41Updated last month
- ☆55Updated 6 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆46Updated 4 months ago
- Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆138Updated last month
- Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"☆51Updated last week
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Updated 5 months ago
- [NeurIPS 2024] Repository for the paper "OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking".☆23Updated 9 months ago
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆30Updated last year
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]☆134Updated this week
- The official pytorch implementation of our AAAI 2024 paper "Unifying Visual and Vision-Language Tracking via Contrastive Learning"☆44Updated 9 months ago