gabfstr / DiffusionTrackLinks
Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking
☆13Updated 2 years ago
Alternatives and similar repositories for DiffusionTrack
Users that are interested in DiffusionTrack are comparing it to the libraries listed below
Sorting:
- Disentangled Pre-training for Human-Object Interaction Detection☆25Updated 3 weeks ago
- Video Feature Enhancement with PyTorch☆31Updated 7 months ago
- LongShortNet for Streaming Perception task.☆13Updated last year
- The official implementation for the CVPR'2025 paper Dynamic Updates for Language Adaptation in Visual-Language Tracking☆28Updated 3 months ago
- ☆14Updated 9 months ago
- Fast and general video object segmentation evaluation.☆32Updated last year
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆17Updated 3 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Updated last year
- ☆26Updated last year
- The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".☆12Updated last year
- [ECCV 2022] Tackling Background Distraction in Video Object Segmentation☆38Updated last month
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Updated last year
- [ICCV2023] Isomer: Isomerous Transformer for Zero-Shot Video Object Segmentation☆30Updated last year
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆43Updated 6 months ago
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models☆17Updated 11 months ago
- [NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking☆11Updated last year
- ☆27Updated 8 months ago
- [AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhi…☆23Updated 11 months ago
- ☆17Updated 8 months ago
- Open-vocabulary Semantic Segmentation☆33Updated last year
- Official Code of CVPR'23 Paper "VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision"☆22Updated last year
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆16Updated 9 months ago
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆16Updated 9 months ago
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆21Updated 4 months ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆13Updated last year
- ECCV 2024 STMA & CVPR 2024 1st MOSE & 1st VOT Challenge & 1st LSVOS v6☆10Updated 9 months ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆16Updated 5 months ago
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆43Updated 7 months ago
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆23Updated 2 years ago
- LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)☆25Updated 11 months ago