guanxiongsun / vfe.pytorchLinks
Video Feature Enhancement with PyTorch
☆29Updated 6 months ago
Alternatives and similar repositories for vfe.pytorch
Users that are interested in vfe.pytorch are comparing it to the libraries listed below
Sorting:
- [ECCV 2022] PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection☆36Updated 2 years ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆53Updated 11 months ago
- (TPAMI 2023) TransVOD:End-to-End Video Object Detection with Spatial-Temporal Transformers (implementations of TransVOD++).☆31Updated 2 years ago
- The official implementation of our ICCV 2023 paper "Objects do not disappear: Video object detection by single-frame object location anti…☆30Updated last year
- [AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhi…☆22Updated 10 months ago
- [CVPR 2024] Exploring Orthogonality in Open World Object Detection☆47Updated last month
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆47Updated 6 months ago
- ☆27Updated last year
- ☆15Updated 10 months ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- ☆36Updated 2 years ago
- Official implementation for "RecursiveDet: End-to-End Region-based Recursive Object Detection" (ICCV 2023)☆17Updated last year
- 🚀【AAAI 2025】Cross-View Referring Multi-Object Tracking☆55Updated last week
- Official Code of CVPR'23 Paper "VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision"☆22Updated last year
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆30Updated 8 months ago
- ☆44Updated 5 months ago
- This is the official PyTorch implementation of ASAG (ICCV 2023).☆18Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆51Updated last year
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆20Updated 3 months ago
- ☆13Updated 5 months ago
- TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)☆50Updated last year
- Multi-Granularity Language-Guided Multi-Object Tracking☆17Updated last week
- (NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"☆27Updated 2 months ago
- [ICCV 2023] Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment☆43Updated last year
- ☆70Updated 8 months ago
- LongShortNet for Streaming Perception task.☆13Updated last year
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Updated last year
- ☆13Updated 2 years ago
- ☆13Updated 8 months ago