guanxiongsun / vfe.pytorch
Video Feature Enhancement with PyTorch
☆29Updated 5 months ago
Alternatives and similar repositories for vfe.pytorch
Users that are interested in vfe.pytorch are comparing it to the libraries listed below
Sorting:
- [ECCV 2022] PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection☆36Updated 2 years ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- ☆14Updated 9 months ago
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆30Updated 8 months ago
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆47Updated 5 months ago
- 🚀【AAAI 2025】Cross-View Referring Multi-Object Tracking☆53Updated last month
- Official Code of CVPR'23 Paper "VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision"☆22Updated last year
- ☆44Updated last year
- [ICCV 2023] Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment☆43Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆29Updated last year
- ☆12Updated 11 months ago
- ☆44Updated 4 months ago
- The official implementation of our ICCV 2023 paper "Objects do not disappear: Video object detection by single-frame object location anti…☆29Updated last year
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆52Updated last year
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆53Updated 10 months ago
- (TPAMI 2023) TransVOD:End-to-End Video Object Detection with Spatial-Temporal Transformers (implementations of TransVOD++).☆31Updated 2 years ago
- [CVPR 2024] Exploring Orthogonality in Open World Object Detection☆47Updated last week
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆19Updated 2 months ago
- [ECCV2024] Official implementation of the paper "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dat…☆83Updated 2 months ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆29Updated last year
- [NeurIPS 2023] Type-to-Track: Retrieve Any Object via Prompt-based Tracking☆13Updated last year
- ☆36Updated 2 years ago
- (TPAMI 2023) TransVOD:End-to-End Video Object Detection with Spatial-Temporal Transformers (implementations of TransVOD Lite).☆39Updated last year
- Multi-Granularity Language-Guided Multi-Object Tracking☆17Updated last month
- TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)☆50Updated last year
- ☆19Updated 9 months ago
- Code Implementation of "Unsupervised Recognition of Unknown Objects for Open-World Object Detection"☆26Updated last year
- Taming Self-Training for Open-Vocabulary Object Detection, CVPR 2024☆21Updated last year
- Improving Mamaba performance on Video Understanding task☆39Updated 6 months ago