Event-AHU / VehicleMAELinks
[AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhicheng Zhao, Zhe Chen, Yukai Shi, Jin Tang
☆22Updated 10 months ago
Alternatives and similar repositories for VehicleMAE
Users that are interested in VehicleMAE are comparing it to the libraries listed below
Sorting:
- MMPD Dataset from ECCV'2024 "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"☆17Updated 10 months ago
- Video Feature Enhancement with PyTorch☆29Updated 6 months ago
- [ECCV 2024] PyTorch implementation of Rethinking Features-Fused-Pyramid-Neck for Object Detection☆17Updated 6 months ago
- ☆19Updated 2 years ago
- Official Code of CVPR'23 Paper "VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision"☆22Updated last year
- Official implementation for "RecursiveDet: End-to-End Region-based Recursive Object Detection" (ICCV 2023)☆17Updated last year
- An implementation of MSSRM method☆11Updated 2 years ago
- ☆32Updated 2 years ago
- Unifying Visual Perception by Dispersible Points Learning (ECCV 2022)☆51Updated 2 years ago
- ☆13Updated 2 years ago
- Knowledge Distillation Toolbox for Semantic Segmentation☆17Updated 2 years ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆44Updated 6 months ago
- Teach-DETR: Better Training DETR with Teachers☆31Updated last year
- Rotation Equivariant Siamese Networks for Tracking☆26Updated 3 years ago
- [ECCVW 2022] UAD: Localization Uncertainty Estimation for Anchor-Free Object Detection☆15Updated last year
- ☆33Updated 3 years ago
- CLIMB-ReID: A Hybrid CLIP-Mamba Framework for Person Re-Identification(AAAI2025)☆19Updated 3 months ago
- DATE: Dual Assignment for End-to-End Fully Convolutional Object Detection☆41Updated last year
- official code for Dynamic Smooth Label Assignment☆10Updated 2 years ago
- ☆18Updated 2 years ago
- ☆21Updated 3 years ago
- ☆25Updated last year
- Lightweight Transformer for Multi-modal Tasks☆16Updated 2 years ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆53Updated 11 months ago
- ☆23Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- ☆31Updated 3 years ago
- source codes of "Summarize and Search: Learning Consensus-aware Dynamic Convolution for Co-Saliency Detection" (ICCV2021)☆17Updated 3 years ago
- ☆44Updated 5 months ago