Event-AHU / VehicleMAE
[AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhicheng Zhao, Zhe Chen, Yukai Shi, Jin Tang
☆21Updated 6 months ago
Alternatives and similar repositories for VehicleMAE:
Users that are interested in VehicleMAE are comparing it to the libraries listed below
- MMPD Dataset from ECCV'2024 "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"☆15Updated 7 months ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- Official implementation for "RecursiveDet: End-to-End Region-based Recursive Object Detection" (ICCV 2023)☆16Updated last year
- Video Feature Enhancement with PyTorch☆26Updated 2 months ago
- 🚀【AAAI 2025】Cross-View Referring Multi-Object Tracking☆37Updated last month
- CLIMB-ReID: A Hybrid CLIP-Mamba Framework for Person Re-Identification(AAAI2025)☆16Updated 2 months ago
- ☆19Updated 2 years ago
- ☆13Updated 2 years ago
- [ECCVW 2022] UAD: Localization Uncertainty Estimation for Anchor-Free Object Detection☆15Updated last year
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆39Updated 2 months ago
- Official Code of CVPR'23 Paper "VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision"☆22Updated 10 months ago
- Teach-DETR: Better Training DETR with Teachers☆30Updated 11 months ago
- Fast Template Matching and Update for Video Object Tracking and Segmentation☆25Updated 3 years ago
- [ECCV2024] Official implementation of the paper "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dat…☆67Updated last month
- source codes of "Summarize and Search: Learning Consensus-aware Dynamic Convolution for Co-Saliency Detection" (ICCV2021)☆17Updated 3 years ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- ☆32Updated 2 years ago
- Knowledge Distillation Toolbox for Semantic Segmentation☆17Updated 2 years ago
- TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)☆44Updated 10 months ago
- ☆34Updated 2 years ago
- Lightweight Transformer for Multi-modal Tasks☆15Updated 2 years ago
- ☆17Updated 2 years ago
- ☆10Updated last year
- One-to-Few Label Assignment for End-to-End Dense Detection (CVPR2023)☆39Updated last year
- Rotation Equivariant Siamese Networks for Tracking☆26Updated 3 years ago
- ☆40Updated last year
- An implementation of MSSRM method☆11Updated last year
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆31Updated 2 years ago