qianyuzqy / TransVOD_Lite
(TPAMI 2023) TransVOD:End-to-End Video Object Detection with Spatial-Temporal Transformers (implementations of TransVOD Lite).
☆39Updated last year
Alternatives and similar repositories for TransVOD_Lite:
Users that are interested in TransVOD_Lite are comparing it to the libraries listed below
- (TPAMI 2023) TransVOD:End-to-End Video Object Detection with Spatial-Temporal Transformers (implementations of TransVOD++).☆30Updated 2 years ago
- [ECCV 2022] PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection☆35Updated 2 years ago
- CVPR 2023☆60Updated 2 months ago
- Video Feature Enhancement with PyTorch☆28Updated 4 months ago
- The repository is the code for the paper "End-to-End Video Object Detection with Spatial-TemporalTransformers"☆226Updated last year
- Spatio-channel Attention Blocks for Cross-modal Crowd Counting -- Official Pytorch Implementation (ACCV'22, Oral)☆27Updated last year
- TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)☆46Updated 11 months ago
- ☆33Updated last year
- the tracking code, dataset and evaluation code of tiny object tracking☆23Updated last month
- The official implementation of our ICCV 2023 paper "Objects do not disappear: Video object detection by single-frame object location anti…☆26Updated last year
- [ECCV2024] Official implementation of the paper "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dat…☆75Updated last month
- Implicit Motion Handling for Video Camouflaged Object Detection (CVPR 2022)☆65Updated 2 years ago
- Spatial-Temporal Feature Transformation for Video Object Detection, MICCAI2021☆49Updated 2 years ago
- Official implementation for "IoU-Enhanced Attention for End-to-End Task Specific Object Detection"☆18Updated 2 years ago
- RGB-T Crowd Counting from Drone: A Benchmark and MMCCN Network☆35Updated 11 months ago
- Full-Duplex Strategy for Video Object Segmentation, ICCV, 2021.☆66Updated last year
- (IJCV 2024&ACM MM 2021 Oral) Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation☆19Updated 3 years ago
- The official implementation for paper "SparseTT: Visual Tracking with Sparse Transformers"☆56Updated 2 years ago
- Official Implement of CVPR 2021 paper “Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Count…☆69Updated 3 years ago
- Code release for "Active Teacher for Semi-Supervised Object Detection", CVPR2022☆83Updated 2 years ago
- [WACV-2023] Detection Recovery in Online Multi-Object Tracking with Sparse Graph Tracker☆56Updated 2 years ago
- [AAAI 2023 Oral] Domain-General Crowd Counting in Unseen Scenarios☆32Updated 2 weeks ago
- [CVPR 2023] Adaptive Sparse Pairwise Loss for Object Re-Identification☆57Updated last year
- ☆66Updated 6 months ago
- Official implementation of the paper "Progressive End-to-End Object Detection in Crowded Scenes"☆92Updated 2 years ago
- YOLOX Inference code for MOTRv2☆17Updated 2 years ago
- ☆38Updated last year
- MOT, SOT, and Detection papers☆13Updated 2 years ago
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆83Updated last year
- [ICCV 2023] Source code of "Fcaformer: Forward Cross Attention in Hybrid Vision Transformer"☆22Updated last year