Event-AHU / VehicleMAE
[AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhicheng Zhao, Zhe Chen, Yukai Shi, Jin Tang
☆21Updated 5 months ago
Alternatives and similar repositories for VehicleMAE:
Users that are interested in VehicleMAE are comparing it to the libraries listed below
- Video Feature Enhancement with PyTorch☆25Updated last month
- ☆13Updated 2 years ago
- source codes of "Summarize and Search: Learning Consensus-aware Dynamic Convolution for Co-Saliency Detection" (ICCV2021)☆17Updated 3 years ago
- MMPD Dataset from ECCV'2024 "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"☆12Updated 6 months ago
- official code for Dynamic Smooth Label Assignment☆10Updated 2 years ago
- Unifying Visual Perception by Dispersible Points Learning (ECCV 2022)☆51Updated 2 years ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- ☆19Updated 2 years ago
- ☆32Updated 2 years ago
- [ECCVW 2022] UAD: Localization Uncertainty Estimation for Anchor-Free Object Detection☆15Updated last year
- 🚀【AAAI 2025】Cross-View Referring Multi-Object Tracking☆36Updated 3 weeks ago
- Official implementation for "RecursiveDet: End-to-End Region-based Recursive Object Detection" (ICCV 2023)☆16Updated 11 months ago
- [ECCV2024] Official implementation of the paper "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dat…☆59Updated 6 months ago
- [ICCV2023] Isomer: Isomerous Transformer for Zero-Shot Video Object Segmentation☆30Updated last year
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆38Updated last month
- Knowledge Distillation Toolbox for Semantic Segmentation☆17Updated 2 years ago
- An implementation of MSSRM method☆11Updated last year
- CLIMB-ReID: A Hybrid CLIP-Mamba Framework for Person Re-Identification(AAAI2025)☆16Updated last month
- Lightweight Transformer for Multi-modal Tasks☆15Updated 2 years ago
- (IJCV 2024&ACM MM 2021 Oral) Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation☆19Updated 2 years ago
- ☆17Updated 2 years ago
- This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"☆19Updated last year
- ☆10Updated last month
- Adaptive Split-Fusion Transformer (ICME 2023 Oral)☆15Updated 10 months ago
- Multiple Anchor Learning for Visual Object Detection (CVPR,2020)☆14Updated 3 years ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆35Updated last year
- Teach-DETR: Better Training DETR with Teachers☆30Updated 10 months ago
- Official PyTorch implementation of "DECO: Query-Based End-to-End Object Detection with ConvNets"☆42Updated 3 months ago
- One-to-Few Label Assignment for End-to-End Dense Detection (CVPR2023)☆40Updated last year